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ABSTRACT 


Some  examples  of  linear  ill -posed  problems  in  engineering  are 
given  and  a general  class  of  regularization  methods  for  ill-posed 
linear  operator  equations  is  studied.  Rates  of  convergence  for  the 
general  method  are  established  under  various  assumptions  on  the  data. 
Applications  are  given  to  a number  of  iterative  and  noniterative 
regularization  algorithms. 
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FOREWORD 

This  report  describes  work  performed  in  the  Air  Force  Flight 
Dynamics  Laboratory  during  the  summer  of  1978.  The  research  was 
supported  by  the  Air  Force  Office  of  Scientific  Research  through 
the  USAF-ASEE  Summer  Faculty  Research  Program  (WPAFB) , Contract 
F44620-76-C-0052,  The  Ohio  State  University  Research  Foundation, 
Columbus,  Ohio. 
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SECTION  I 
INTRODUCTION 

The  concept  of  a well-posed  problem  was  formulated  by  Hadamard 
early  in  this  century.  In  broad  terms,  a problem  is  well-posed  in 
the  sense  of  Hadamard  if  it  has  a unique  solution  which  depends  con- 
tinuously on  the  data  of  the  problem.  Specifically,  if  T is  a trans- 
formation from  a metric  space  X into  a metric  space  Y,  then  the  prob- 
lem 

Tx  = b (1) 

is  said  to  be  well-posed  if 

( i)  for  each  beY  there  is  a solution  xcX, 

( ii)  the  solution  x is  unique,  and 

(iii)  the  solution  x depends  continuously  on  the  "data"  b. 

A problem  which  is  not  well-posed  is  called  "ill-posed."  Ill- 
posed  problems  have  been  intensively  studied  during  the  last  fifteen 
years,  especially  by  Soviet  mathematicians  (see  {12], [23]),  because 
of  their  importance  in  many  engineering  applications  (see  [11]  and 
[14]  for  specific  areas  of  Air  Force  interest).  In  this  report  we 
will  be  concerned  with  linear  ill-posed  problems,  that  is,  we  will 
study  the  problem  (1)  where  T is  a linear  operator  on  Hilbert  space. 

A typical  problem  of  this  type  is  the  integral  equation  of  the  first 
kind 


l 


(2) 


d 

J'  k(s,t)x(s)ds  = b(t) 
a 


2 

where  the  kernel  k is  a member  of  L ( [a,d]x[a,d] ) (the  space  of 

Lebesgue  square  integrable  functions  on  the  rectangle  [a,d]x[a,d]) 

2 

and  beL  [a,d]  (we  allow  a or  d to  be  infinite).  Such  equations  are 
notoriously  ill-posed.  For  example,  if  k(s,t)  = t + c,  then  (2)  can 
have  a solution  only  if  b is  a linear  function,  violating  (i).  If 
k(s,t)  = sin(s)  and  b(t)  = 2,  then  by  the  well-known  orthogonality 
relations. 


IT 


fUs 
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t)  (1  + sin(ms))ds  = b(t),  m 


2,3,... 


which  violates  (ii) . Far  more  serious  is  the  fact  that  (iii)  is 
violated  for  equations  of  type  (2).  Indeed,  by  the  Riemann-Lebesgue 
lemma,  for  arbitrary  A, 


1 

/k(s.t) 

o 


A sin 


(mirs)ds  -*■  0 


as  m 


and  hence  solutions  do  not  depend  continuously  on  the  data. 

Numerical  methods  for  analyzing  ill-posed  linear  problems  are 
particularly  important  because  a large  number  of  engineering  prob- 
lems have  the  form  (2).  Consider  for  example  the  one  dimensional 
heat  equation 


.in 


m 


2 


32u  3u 


u(x,0)  = h(x) . 


It  is  well  known  that  the  temperature  distribution  f (x)  = u(x,T)  at 
some  time  T > 0 can  be  expressed  in  terms  of  the  initial  temperature 
distribution  h(x)  by 

00 

f(x)  = — — f exp(-(x— r)2/(4T))h(r)d  /. 

2/nT-  J 
—00 

The  "inverse"  problem  of  determining  the  initial  temperature  distri- 
bution h(x),  given  the  distribution  f(x)  at  the  later  time,  is  of 
considerable  interest  and  is  an  ill-posed  problem  of  type  (2) . 

Another  problem  of  type  (2)  is  the  numerical  differentiation 
problem.  The  nth  derivative  of  a given  function  b(t)  (with  b(0)  = 
b'(0)  = ...  = b^n  ^ (0)  = 0)  satisfies 


t 

f (n-~l)~!~  ^t-s)n~lx^s)ds  = MO. 

0 

This  problem  has  been  studied  extensively  within  the  context  of  ill- 
posed  problems  by  Cullum  [3],  Franklin  [5]  and  others. 

Another  example  is  afforded  by  the  work  of  Lee  [13]  and  Pro- 
vencher  [18]  on  the  determination  of  the  molecular  weight  distribution 
of  a solute  from  centrifuge  data.  In  this  example  the  molecular 
weightvdistribution  f(m)  satisfies 


U(x)  * f 
0 


, 2 2 -Amx 
A m e 


f (m)dm 
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where  U is  a function  which  is  proportional  to  the  measured  con- 
centration gradient  and  A is  a constant  which  is  proportional  to  the 
square  root  of  the  rotor  speed. 

As  a final  example,  we  give  the  two  dimensional  integral  equa- 
tion 


f P(x%y0  dx^dy ^ = 

y(x-x')2  + (y-y')2]’"2 


fj_  (x,y)  - f2(x,y) 


which  was  studied  by  Singh  and  Paul  [21]  and  concerns  the  pressure 
distribution  in  the  contact  of  nonconforming  elastic  bodies. 

Integral  equations  of  the  first  kind  also  arise  in  the  deter- 
mination of  the  shape  of  conducting  bodies  from  backscattered  elec- 
tromagnetic radiation  ([16], [17]),  seismic  prospecting  [2],  antenna 
theory  [4],  remote  probing  of  the  atmosphere  ([22], [24]),  medical 
tomography  [6]  and  system  identification  ([!],[ 15 ] ) . 


SECTION  II 

GENERALIZED  INVERSES 

We  will  henceforth  assume  that  and  are  Hilbert  spaces  and 
that  T:H^  ■>  is  a bounded  linear  operator.  The  inner  product  and 
norm  in  each  space  will  be  denoted  by  (•,•)  and  ||»||,  respectively. 
The  range  and  nullspace  of  T will  be  denoted  by  R(T)  and  N(T),  re- 
spectively. Our  task  is  to  solve  the  ill-posed  problem  (1)  for 
xeH^  given  beH^.  Of  course,  if  b^R(T)  then  (i)  is  violated  and  there 
is  no  solution.  In  such  a case  we  might  reasonably  adopt  the  more 
flexible  attitude  of  replacing  b in  the  right  hand  side  of  (1)  by 
the  point  in  R(T)  which  is  nearest  to  b.  However,  if  R(T)  is  not 
closed,  such  a closest  point  may  not  exist.  We  are  then  led  to  accept 
as  a generalized  solution  any  vector  ueH^  which  satisfies 

Tu  = Pb  O) 

where  P is  the  projection  of  H ^ onto  R(T) , the  closure  of  R(T).  Any 
vector  u satisfying  (3)  is  called  a least  squares  solution  of  equa- 
tion (1) . We  note  that  a least  squares  solution  will  exist  for  any 
vector  b whose  projection  onto  R(T)  lies  in  R(T) , i.e.,  for  all  vec- 
tors  b in  the  dense  subspace  R(T)  © R(T)  of  It  is  not  diffi- 

cult to  see  that  least  squares  solutions  may  also  be  characterized 
as  vectors  ueH^  which  satisfy  either  of  the  conditions 


|Tu  - b | | <_  | | Tx  - b | | , for  all  x e H^,  (4) 


r 

where  T is  the  adjoint  of  T (see  [7]  for  a proof  of  this  and  other 
simple  facts  pertaining  to  this  section). 

We  have  seen  that  if  we  consider  least  squares  solutions  in- 
stead of  traditional  solutions,  then  difficulty  (i)  is  to  a certain 
extent  obviated.  The  problem  of  nonuniqueness,  however,  remains  at 
this  point.  Indeed,  if  N(T)  ^ {0}  then  there  may  be  infinitely  many 
least  squares  solutions,  for  if  u is  a least  squares  solution,  then 
so  is  u+v  for  any  veN(T).  Fortunately,  there  is  a natural  way  of 

| 

selecting  a least  square  solution  which  is  unique  in  a certain  sense. 

We  see  from  (5)  that  the  set  of  all  least  squares  solutions  is  a 
closed  convex  set.  This  set  therefore  contains  a unique  vector  of 
smallest  norm  and  it  is  this  vector  which  we  will  accept  as  the 
unique  generalized  solution  of  equation  (1).  Let  $(T^)  = R(T)  © R(T). 
The  operator 

T+:0(T+)  -*■  Hx 

% 

which  associates  with  each  b e$(T+)  the  unique  least  squares  sol- 
ution of  equation  (1)  with  minimal  norm  is  called  the  generalized 

*|* 

inverse  of  T.  It  is  not  difficult  to  show  that  T is  a closed  linear 

operator  (see  [7]).  If  T^  were  continuous  then  problems  (i),  (ii) , 

+ 

(iii)  would  be  solved,  at  least  for  b£j5(T  ).  But  alas  this  is  not 
the  case.  It  is  not  difficult  to  show  that  T+  is  continuous  if  and 
only  if  R(T)  is  closed.  Unfortunately  the  range  of  an  integral  op- 
erator is  closed  if  and  only  if  its  kernel  is  degenerate  (see  [7]). 

•4* 

We  are  therefore  led  to  seek  approximations  to  T'  by  bounded  linear 


I 

I I 

1 

d 


6 


fl 
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SECTION  III 


A GENERAL  METHOD 


* * 

We  will  denote  the  operator  T T by  T and  the  operator  TT  by 

f.  Note  that  T and  T are  self  adjoint  linear  operators  whose  spec- 

2 

tra  lie  in  the  interval  [0,||t||  ].  If  0 t o(T)  (the  spectrum  of 
- t --1  * 

T),  then  by  (5)  we  have  T * T T . In  general,  however,  0 e o(T), 

but  this  last  equation  nevertheless  leads  us  to  seek  approximations 
+ -A 

to  T by  operators  of  the  form  U(T)T  where  U is  a continuous  func- 
tion  on  [0,||t||  ] which  approximates  the  function  f(t)  * t in 
some  sense.  Specifically,  we  will  consider  a family  (net)  of  real 
valued  functions  {U0(t):BeS},  indexed  by  a subset  S of  the  positive 

P 

2 

real  numbers  with  »>eS,  where  each  Ug  is  continuous  on  [0,||t||  ] 

and  such  that 


| tU  ( t ) | < M for  all  t and  6 

P 


yt)  - 1 


The  following  is  proved  in  [7]. 


as  8 -*■  00  for  each  t f 0.  (7) 


Theorem  1.  Suppose  be$(T^)  and  let  Xg  = Ug(T)T  b.  Then  Xg  -*  T^b 


To  this  we  now  add, 

Theorem  2.  If  b^/J(T+),  then  (x„)  has  no  weakly  convergent  subnet. 

tJ 

Proof.  Suppose  {x  >}  is  a subnet  of  (x  } which  converges  weakly 

B 8 

to  zeH, , denoted  x0^  5 z.  By  the  weak  continuity  of  bounded  linear 

1 P 

operators  we  then  have  Txq * * Tz. 


- - ■*— . 


PRECEDING  PAGE  BLANK* NOT  FILMED 


Now, 


Pb  - Tx„  = Pb  - TU0 (T)T*b 

B p 

= Pb  - TU  (T)Pb . 

p 

/s  A 

However,  by  (6)  and  (7),  the  operator  TU  (T)  converges  pointwise  to 

p 

-I  * X 

the  projection  of  ^ onto  N(T)  = N(T  ) = R(T).  Therefore 

Pb  - Tx  , -*>  0.  It  then  follows  that  Pb  = Tz,  a contradiction.// 

p 

s'c 

In  the  proof  above  we  have  used  the  fact  that  U,.(T)T  = 

p 

* » 

T U (T) . This  is  easy  to  see  if  UQ  is  a polynomial.  In  the  general, 
case  the  identity  follows  from  the  Weierstrass  approximation  theorem. 
Using  the  fact  that  bounded  sets  in  Hilbert  space  are  weakly  com- 
pact, we  have: 

Corollary  3.  If  ) , then  | jx^  | ) -*■  <®  as  B 00 . 

Theorem  1 and  Corollary  3 demonstrate  dramatically  the 

unequivocal  nature  of  the  approximations  (x  }. 

p 

Several  authors  have  established  rates  of  convergence  for 
various  approximations  to  T^b  under  the  stronger  assumption  that 
Pb  e R(T)  (see  [20] , [9] , [10] ) . We  see  from  Corollary  3 that  the 
very  least  we  must  require  to  get  convergence  at  all  is  that 
Pb  z RfT).  In  order  to  strengthen  this  condition  only  slightly  and 
thereby  obtain  a rate  of  convergence  we  note  that 

R(T)  - R(TP  ) 

NOT 

and,  in  the  pointwise  sense, 

10 


k 


P L = lim  T . 

N^T>  v-*0+ 

It  therefore  seems  reasonable  to  replace  the  hypothesis  b e$(T^), 
i.e.,  Pb  e R(T) , by  the  hypothesis  Pb  e R(TTV)  for  some  v > 0.  In 
order  to  gauge  the  rate  of  convergence  we  will  replace  (7)  by  the 
stronger  condition 

tv|l  - tUg(t)|  <_  w(B,v)  for  v > 0 (8) 

where  u>(8,v)  + 0 as  8 -*  <*>  for  each  v > 0 (the  case  v = 1 was  con- 
sidered in  [8]). 

-v  x 

Lemma  4.  If  v > 0,  then  R(T ) C.  N(T)  . 

Proof.  Suppose  {f^}  is  a sequence  of  continuous  real  valued  func 

tions  on  [0 , { | T | j 2 ] such  that  f n ( t ) -*■  tV  1 for  t # 0 and  cfn(c)  is 

uniformly  bounded  (for  example,  we  may  take  fn(t)  = tV  ^ for  t >_  1/n 
2— v 

and  f (t)  = n t for  0 < t < 1/n).  Let  {E.}  be  the  resolution  of 
n — — t 

the  identity  generated  by  the  self-adjoint  operator  T.  By  the 
bounded  convergence  theorem  we  then  have 

.v  ,IMI22  ,IItII2 

T y = J t dE  y = J lim  tf  (t)dE  y 

0 C 0 n n C 

J|t||2  . - X 

= lim  / tf  (t)dE  y = lim  Tf  (T)y  e N(T)  .it 

_ n t n 

n 0 n 

We  now  state  a rate  of  convergence  result.  The  vector 


T^b  will  be  denoted  by  x and  the  error  x - x.  by  e . 


Theorem  5.  If  Pb  = TTVw,  where  v > 0,  then  | | | | <_  w(B,v)  | |w|  | . 
Proof.  Since  Tx  = Pb  = TTVw  and  since  x - TVw  e N(T)  , we  see 

that  x = TVw.  Now, 


xB  = Ug (T)T*b  = Ug(T)T*Pb 

_ - - \v4-1 

* Ue(T)Tx  = Ug(T)T  w. 

Therefore  e.  = x - x.  =*  TV(I  - U0(T)T)w. 
p p p 

By  the  Spectral  Mapping  Theorem  and  Radius  Formula,  we  then  have 


I |e0 | | 1 w(B,v) | |w| | . # 


In 

assumptions  on 
Lemma  6,  If 
Proof.  As 


Therefore 

Theorem  7.  If 
Proof.  In 


our  next  result  we  become  more  cavalier  in  our 
the  data. 

Pb  = TVw  where  v >_  1,  then  | | e^  | <_  u)(S,v-l)  | | Te ^ | | | |w|  | 

& v— 1 

in  the  previous  proof  we  find  that  x = T T w.  Also, 
xQ  =■  U„  (T)T*Pb  = U_d)T*TVw 

P P P 

= T*U  (T)TVw. 

e^  = x - x0  = T*(I  - Ub(T)T)TV_1w,  and 

I |eg | | 2 - (ee,T*(I  - Ug(T)T)TV-1w) 

- (Teg,(I  - Ub(T)T)TV_1w)  < o)(S,v-1)||w||  J | Teg | J . 
Pb  = TVw  where  v >_  l,  then  | ( e g | f 2 <_  oj(B,  v)oj(3,  v-1)  | |w(  | . 
Lemma  6 we  saw  that 

eg  = T* (I  - Ug(T)T)fV-1w, 


therefore 

Te  = T*TV(I  - U (T)T)w. 

P P 

We  then  have 

||Te6||2  = (Te6,e6)  = (TV(I  - Ug(T)Tw,Teg) 

<_  w(B,v)  | |Teg|  | , i.e.,  J | Teg  | | <_ui(8,v). 

Substituting  into  the  result  of  Lemma  6 completes  the  proof.// 

In  the  next  section  we  will  give  a number  of  examples  of 
specific  computational  techniques  to  which  the  above  results  apply. 

We  have  avoided  for  long  enough  the  problem  of  polluted 
data.  We  now  take  up  this  question.  Suppose  that  the  data  b is  the 
result  of  measurements  so  that  instead  of  b we  have  in  our  possession 
a corrupted  version  b£  satisfying  ||b  - be||  £ e.  We  operate  on  the 
vector  be  to  obtain  the  approximations  xG  given  by 

p 

x*  - Ug(T)T*be. 

Let  <H0)  ■ sup{  I tUfl(t)  | :t  e [0,||t||2]},  and  recall  that  <K6)  is 

P 

bounded  (by  (6)). 

Lemma  8.  | |Txg  - Txg  | | <_  e 4> (B)  - 

Proof.  T(x.  - x * ) - TU0 (T)T*(b  - bG) , therefore 

P P P 

I ITxb  - TxbII2  = (*(xe  ■ x8Kx8  " x8} 

= (TUg(T)T*(b  - be),xg  - xG) 

- (TUg (T) (b  - bC),T(xg  - xg)) 
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note  that 


1 *(B)||b  - bel|  | |T(x0  - xp\\ 
- 1 e<K0)||Txg  - TxE | | . // 


Suppose  now  that  g(g)  = sup{ ju. (t) I : t e 

P 


CO, I |t| I 2] } . 


We 


g(B)  -*•  00  as  0 -*■  <».  (9) 

Indeed,  if  this  were  not  the  case,  then  there  would  be  a constant  L 
such  that  |USU)|  < L for  all  t and  6.  But  then  | tU  (t)  J - <_  Lt  -*■  0 
as  t -*■  0,  contradicting  (7). 

Lemma  9.  | |x&  - xE|  | <.  e/g (B)*(B). 

Proof.  Since  Xg  - xE  = T Ug(T)(b  - be)  , we  have,  by  use  of 
Lemma  8 , 

ll*g  - Xgll2  - (Xg  - Xg,T*Ug (T) (b  - bE)) 

- (T(Xg  - xE),Ug(T)(b  - be)) 

< e2d>(0) g(0).# 

Suppose  now  that  Pb  = Tw  (we  could  also  use  the  other 
hypotheses  considered  above,  but  we  choose  to  consider  this  simple 
case  to  illustrate  the  ideas).  By  the  triangle  inequality  we  have 

I lX  - Xgl I 1 | |x  - Xg| I + I |Xg  - Xgl I . 

Lemma  9 and  Theorem  7,  then  give 
Theorem  10.  If  Pb  = Tw,  then 


:_r  " ^ 
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I lx  - Xgl  I 1 (I  |w|  |u(6,l)w(8,0))Ji  + e(g(6)<K3))!i. 


The  first  term  x>n  the  right  hand  side  of  this  inequality 
goes  to  zero  as  D + ®.  However,  by  (9)  and  (7),  the  second  term  be- 
comes infinitely  large  as  M This  illustrates  the  classic  di- 
lemma in  the  numerical  treatment  of  ill-posed  problems.  Even  if 
computations  are  performed  exactly,  small  errors  in  the  data  may 
eventually  grow  and  overpower  the  approximations. 

In  view  of  Theorem  10,  the  question  naturally  arises  as 
to  whether  it  is  ever  possible  to  obtain  convergent  approximations 
even  if  the  data  can  be  measured  as  precisely  as  desired.  Specifi- 
cally, is  there  an  effective  way  of  choosing  a "stopping  parameter" 
6(e)  such  that  e , . + 0 as  e + 0?  This  problem  of  choice  of  reg- 
ularization  parameters  is  of  great  import  and  still  has  not  been 
satisfactorily  answered.  For  the  wide  class  of  methods  considered 
here  the  question  is  particularly  difficult,  for  as  we  shall  see  in 
the  next  section,  the  parameter  may  take  on  discrete  or  continuous 
values  depending  upon  the  specific  method  under  consideration. 


SECTION  IV 


SPECIFIC  METHODS 


In  this  section  we  will  consider  some  specific  choices  for  the 
functions  {U _ ( t ) } and  we  will  find  functions  u)(8,v)  which  determine 

P 

rates  of  convergence.  The  index  set  S in  all  examples  below  will  be 
either  the  set  of  nonnegative  reals  or  nonnegative  integers.  In  the 
discrete  case,  the  parameter  8 will  be  denoted  by  n. 

As  a first  example  we  consider  Showalter's  integral  formula  [19] 


t * 

T b = /exp(-uT)T  bdu. 


The  functions  U for  this  example  have  the  form 

P 

6 

U (t)  = / exp(-ut)du 
B 0 

and  may  be  motivated  in  terms  of  Borel  summability  [7].  It  is  not 
difficult  to  see  that  a function  io(B,v)  satisfying  (8)  is  given  by 


uK8,v)  = 3 


(v  > 0). 


The  choice  U.(t)  = (t  + 3"1)"1  (6  > 0)  leads  to  Tychonov’s  reg- 
P 

ularization  of  order  zero  [23].  Here  one  can  readily  verify  that 


j(S,v)  * 3~  for  0 < v < 1. 


In  order  to  obtain  approximations  with  this  rate  for  v > 1 we  may 
use  extrapolated  regularization  [9].  That  is,  for  a given  3 > 0 we 
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U^0)U)  = (t  + ff1)-1 
and  define  Richardson  extrapolants  by 

= (2ju£j~L)U)  - Ugj"1)(t))/(2:j  - 1), 

j = 1»2,  ...  . 

It  is  not  difficult  to  show  (see  [9,  lemma  2.1])  that  for  k = 0,1,2, 


tk+1  j i - tu^(t)|  = TT  ( — - ) 


,(k) 

e 


< 8 


i=0  2 It  + 1 
-k-1 


Therefore,  for  the  kth  extrapolant  we  may  apply  Theorem  7 with 

oj(8,k)  = 8-k~1,  k = 1,2,...,  to  obtain  the  rate  8 k+2  (see  [9,  The- 

' • 

orem  3.2]) . 

We  now  consider  some  iterative  regularization  methods.  Below, 

__ 2 

a will  be  a parameter  satisfying  0 < a < 2 j I T ( | 

If  the  functions  U (t) , n - 0,1,2,...  are  defined  by 
n 

n k 

U (t)  -a  £ (1  - at) 

n k=0 

then  (6)  and  (7)  are  satisfied  and  one  can  show  that 

nvtv|l  - tUn(t)|  = nvtvjl  - at|n+1 

is  uniformly  bounded.  From  this  we  find  that  the  rate  of  conver- 
gence of  the  iterative  process 

Xo  = aT*b’  Xn+1  = (I  ' ai’)Xn  + aT"b 
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is  determined  by  the  function  ui(n,v)  = n V. 

Newton’s  method  for  approximating  t ^ leads  to  the  sequence  of 
functions  defined  by 

Ve)  ' »*  W'’  ' V'>12  - 'V'”' 

For  this  sequence  of  functions  it  is  not  difficult  to  see  that 

tV|l  - tU  (t)|.-  0(2~Vn)  for  v > 0. 

Therefore  the  rate  of  convergence  of  the  corresponding  iterative 
method  is  determined  by  the  function  w(n,v)  = 2 Vn. 

Showalter  and  Ben-Israel  [20]  have  extrapolated  on  the  previous 
method  to  obtain  methods  with  a higher  rate  of  convergence.  For  a 
positive  integer  p >_  2 they  define  the  hyperpower  methods  in  terms 
of  the  sequence 

P*1  k 

U (t)  - ct,  U (t)  = U (t)  I (1  - tU  (t))\ 

* o n+JL  n . „ n 

k=0 

For  these  methods  the  results  above  may  be  used  to  obtain  the  con- 
vergence rate  0(p  Vn) . 

In  [11]  Lardy  considered  the  approximations 

xo  * 0,  Txn  + xn  = xn_x  + T*b * " = 1.2,... 

to  T^b,  where  T is  an  unbounded  operator.  We  may  apply  the  results 
above  in  the  case  of  a bounded  operator  if  we  define  the  functions 


U(t)  = I (t  + i)”\ 
n ktl 

One  can  verify,  as  in  the  first  iterative  example  above,  that  the 
function  w(n,v)  = n_V  determines  a rate  of  convergence. 

The  iterative  method 


* 

x = T b, 
o 


ti+1 


= X 


+ (T  b 


Tx  )/(n 
n 


+ 2), 


was  investigated  in  [10].  The  appropriate  functions  are  given 
by 

n . k-1 

DU)*  I(k  + 1)  IT  a - + J)>- 

n k=0  j=0 


This  leads  to  the  iterative  method 


x T b,  xn+l  xn 


x l,  = x_  + (T  b - Txn)/(n+2). 


Following  the  analysis  given  in  [10]  one  can  show  that  the  rate  of 
convergence  of  this  method  is  governed  by  the  function  (o(n,v)  = 


r 
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FOR  TRANSONIC  FLUTTER  CALCULATIONS 
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Abstract 

An  inp  licit  finite-difference  method  was  developed  for 
solving  the  full  time-dependent  potential  equation  of  aerodynamics. 
The  method  was  shown  to  be  unconditionally  stable  and  of  second 
order  accuracy  in  time  and  space.  The  intended  use  of  the  method 
is  in  a simultaneous  solution  of  the  aerodynamic  and  structural 
equations  under  transonic  flutter*  conditions.  The  method  was 
programmed  and  numerical  results  indicate  the  usual  sensitivity 
of  high  order  methods  to  disturbances  produced  by  a shock  wave. 
Several  reccrmendations  are  made  to  remedy  the  situation.  Among 
them  is  a suggestion  on  the  correct  use  of  the  tri-diagonal  al- 
gorithm of  the  Alternating  Directions  Implicit  rrethod  in  tran- 
sonic flows  where  the  domain  of  dependence  of  the  solution  changes 
as  a function  of  the  local  Mach  number.  Further  development  of 
the  method  is  needed  to  render  it  practical  for  flutter  work. 

* 
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SECTION  I 
' INTRODUCTION 

Tlie  recent  interest  in  transonic  flutter  created  a demand 
for  more  accurate  and  efficient  ccnputational  procedures  for  un- 
steady transonic  aerodynamics.  Hie  present  work  is  an  attenpt 
to  develop  such  a procedure  subject  to  the  requirements  that  the 
numerical  scheme  be:  (1)  an  approximation  to  the  full  potential 
equation,  (2)  of  second  order  accuracy  in  both  time  and  space, 

(3)  applicable  to  high  frequencies,  (4)  not  limited  to  thin  bodies, 
(5)  easily  extendable  to  three-dimensional  wings  and,  (6)  be  un- 
conditionally stable. 

Earlier  attenpts  at  transonic  flew  ccnputations , as  reported 
in  literature,  lead  to  ccrp  remises  so  that  not  all  of  the  above 
six  conditions  were  satisfied.  The  difficulties  of  simultaneously 
meeting  all  these  requirements  have  been  partially  overcome  in  the 
present  work  by  application  of  central  differencing  in  conjunction 
with  the  method  of  factorization  (the  Alternating  Directions  Im- 
plicit method  or  A.D.I.) 

The  ocnputaticnal  method  presented  here  requires  further 
refinements  and  modifications.  Computed  results  show  poor  shock 
capturing  ability  with  flew  field  fluctuations  characteristic  of 
higher  order  methods.  Without  an  artificialy  added  numerical  dis- 
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sipatian  the  method  is  shown  to  be  unconditionally  neutrally 


(i.e.  marginally)  stable  and  of  second  order  accuracy.  Thus 
all  six  requirements  have  been  met;  however,  the  method  does 
not  appear  at  the  present  stage  of  development  to  be  suitable 
for  transonic  calculations  and  several  reccmnendaticns  are  made 
in  SECTION  VI  to  remedy  the  obvious  short  comings. 


SECTION  II  gives  a general  procedure  for  deriving  a factored 
scheme  for  the  potential  equation  of  aerodynamics  which  is  hyper- 
bolic and  of  second  order  in  both  time  and  space.  In  SECTION  III 
the  numerical  stability  of  the  linearized  (constant  coefficients) 
finite-difference  equation  is  considered,  while  seme  computational 
aspects  of  the  alternating  directions  method  are  discussed  in 
SECTION  IV.  The  treatment  of  the  Kutta  trailing  edge  and  of  the 
wake  conditions  is  explained  in  SECTION  V. 

The  main  contributions  of  the  work  reported  here  are  the 
development  of  a nunerical  scheme  of  high  accuracy,  both  in  time 
and  in  space,  applicable  to  the  full  time-dependent  potential 
equation  not  limited  to  thin  bodies  or  low  frequencies,  and  an 
extension  of  the  tri-diagonal  algorithm  for  integration  in  al- 
ternating directions  to  integration  across  the  wake  and  across 
the  imbedded  supersonic  zone.  A novel  idea  of  a type-dependent 
use  of  the  method  of  Alternating  Directions  is  presented  in 
SECTION  VI. 
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SECTION  II 

ALTERNATING  DIRECTIONS  IMPLICIT  SCHEME 

Vfe  shall  consider  the  full  potential  equation  for  two- 
dimensional  irrotational  flows. 


9y 


.a2<q+£|). 


3x  dy 

nondimensionalized  by  referring  u,  v,  a to  U^,  x,  y to  chord  c, 
the  velocity  potential  <j>  to  U c,  and  time  t to  cAJ  , where 

00  oor 

a2  = 1/M2  - (y-1)  [|£  + J5(u2+v2-l)  ] 
is  the  nondimensional  speed  of  sound,  and  u = d<p/dx,  v = 34>/3y - 


Introducing  centered  time  differences,  and  with  t = nAt, 


4>(t)  = 4>(nAt)  = 4>n,  we  have 


<Dn+1-  2<pn  + .j)11'1  + At(u^  + v^)  («n+1-  4>n_1) 

+ At2(u29?  + 2UVW  + V2^?)<t>n 

= At2a2  y + — — *•)  <j>n  + 0(At4) . 


9x  9y' 


This  may  be  rewritten  as 





Without  affecting  the  order  of  accuracy,  we  add  to  the  above 
the  term  (reasons  for  doing  so  are  explained  in  SECTION  IV) 


= 0(At4) . (2 


New  consider  the  following  expressions,  the  first  of  which 
provides  seme  danping  (dissipation)  and  the  second  ccnpletes 
the  products  of  differential  operators : 

- 2aAt2Ax2Ay2(^2  + 

dx*  dyZ  3t 

= - 2aAtAx2Ay2(^l-  + ^L)  (^1.  ^ + 


(3 


and 


Aj_2.__  32  /j.n+1  ^n.ji-l*  . At4/_21..2w_2.„2t  34  ,j*H.ji-1% 

At  uvg^g-  (0  -20  +0  ) + jg-  (a  +u  ) (a  +v  ) — ^ — y(0  +0  ) 

y - 3x  3y 

- At2aAx2Ay2  (u— — T + v-L — + cAx2Ay2-3!  ,)  (<^1+<j>n~1) 

3x3y  3x3y  3x3y 


At!(u(a2+V2)^v  + v(a2+u2)^l_}  . 


3x3y 


3x^3y 


(4) 


O O yi  a 

If  aAtAx^Ay^  = 0(At*) , then  both  expressions  (3)  and  (4)  are  0(At4) 
Adding  (3)  and  (4)  to  Equ. (2)  permits  one  to  factor  the  spatial 
operators  as  follows: 


2 2 

(Qx-aAtAx2Ay2^-?)  (Qy-0AtAx2Ay2i-?)  4>n+1  - A<pn 

2 2 

+ (R+aAtAx2Ay2|— j)  (lycAtAx^y^-j)  d*11'1  = 0 


where 


At2  2,  2. 32 


Qx  = ;L-^(a+U)7T+uAt^, 
2 

_ , Ac  , 2 2,3Z  . 3 

Rx  - 1 - -5— (a  +u  ) — j - uAt^  , 

Qy  = 1 - ^-(a 2+v2)^t  + vAt-9 

Ry  = , . ^(a2+v2)^  - vAt^  , 


^ ' 


At2 , , 2 „ 2,  . 32 


2 -2.  ,3" 


A = 2 + ==-[  (a  -3u  0 )— =■  + (a  -3v  5 )— y] , 
Z x 3x^  y 3y^ 


(5) 


and  where  the  symbols 
fication  resulting  in 
e.g. , use  here  finite 


<5x  and  6^  indicate  a further  possible  modi- 
increased  numerical  dissipation.  We  may, 
difference  averaging  operators  which  replace 
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the  value  of  the  mesh  function  at  a given  point  by  a weighted 
sum  (average)  of  its  neighbors . 

In  order  to  maintain  a second  order  temporal  accuracy,  the 
dissipation  parameter  a should  be  kept  proportional  to  At^/Ax^Ay^. 
Spatial  accuracy  depends  on  the  approximation  to  the  spatial 
partial  derivatives.  With  centered  spatial  differences  second 
order  accuracy  with  respect  to  space  is  assured. 

Implementation  of  the  Alternating  Directions  method  of 
solution  of  the  factored  scheme  (5)  is  treated  in  SECTION  IV. 


SECTION  III 


STABILITY  ANALYSIS 

We  shall  limit  the  study  of  stability  to  the  case  of  linear 
difference  equations,  that  is,  to  the  finite-difference  analog 
of  Equ.  (5)  with  u,  v,  and  a kept  constant. 

A linear  difference  equation  admits  a fundamental  solution 
in  the  form 


= <J>(nAt,  jAx,  kAy)  = <f>0e 


i(na  + j0  + ky) 


where  a/At  = complex  frequency,  and  where  0/Ax  and  y/Ay  are  the 
oatponents  of  the  wave  number  vector.  We  note  the  following  ' 
shift  property  of  the  fundamental  solutions, 

n+N  _ , ia.N.  i0.  J iy  K ,n  _ JJ  J K.n 
*jW,k+K  - (e  > (e  > <e  > 

For  stability  in  the  von  Neumann  sense  we  require  that  |£|  £ 1. 
Further,  we  observe  that  the  anplification  factor  £ and  the  unit 
vectors  n and  Q are  eigenvalues  of  the  shift  operators,  and  that 
finite  differences  are  formed  by  taking  differences  of  the  mesh 
function  <f>(nAt,  jAx,  kAy)  with  its  arguments  shifted  (increased 
or  decreased  by  integer  multiples  of  the  increments  At,  Ax,  Ay) . 
Thus  = £“■*■({) , and  the  centered  spatial  differences  are 


»n 


- HHL.V  - A 


d<p  „ 1 /An  An  x _ n-n  An  _ ,i  An 

3x  " 2Et(*j+l,k“*j-l,k)  " 2ST  ♦jk  = (Zxsul6)<t>jk' 
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t±  - 1 /*n 


-1 


,An  _OJn  ^n  x _ ry-2+n  .n 

i?  - ^?(W 4 VW  - 17-  ♦*  - 


and  similarly. 


d2<P 


2 - - ^(l-ooar)*"k. 


Sy  Ay 

SX  - 5wj+1,ic«^"-i,k>  ’ 5(1+OTs6»»",k  ' 

Sy  - 4(+j,k+l+2*jk+*?,k-l)  = la+ODSY)^,k  • 


- -^(l-ooe8)*"k. 


Substituting  the  fundamental  solution  into  Equ.  (5)  we  have  with 

the  notation  r = At/Ax,  r = At/Ay,  and  with 
x y 

Qjj  = 1 + *sr2(a2+u2)  (1-0058)  + iuTxsin8, 

Qy  = 1 + *sr2(a2+v2)  (1-cosy)  + ivl^siny , 

Rx  = 1 + Jsr2(a2+u2)  (1-oosB)  - iuI^sinP  = Qx, 

Ry  = 1 + hT2(a2+v2) (1-cosy)  - ivl^siny  = Q^, 

A = 2 { 1 - *r2  [a2-|u2(l+cosB)](l-cos8) 

- V2  [a2-|v2  (l+oosy)  ] (1-cosy) } , 

(Q  + e)C  - A + (R  - e)£_1  = (Q  + e)C  - A + (Q  - e)?-1  = 0.  (6) 


where  the  overbar  indicates  a ccnplex  conjugate,  and  where 
Q = QxQy  + 4o2At2Ax2Ay2 (l-cos8) (1-cosy)  = (^  + i Q^, 

e = 2aAt[QxAy2(l-cosy)+Q^Ax2(l-cos8) ] = er  + ie^- 
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Equation  (6)  is  a polynomial  with  complex  coefficients  and  of 

second  degree  in  £.  The  coefficients,  and  therefore  also  the  roots 

of  the  polynomial,  depend  on  eight  parameters,  viz.  on  At,  Ax,  Ay, 

2 

u,  v,  a and  the  angles  3 and  y.  We  are  unable  to  solve  the  com- 
plex polynomial  for  £ in  a closed  form.  However,  we  may  make  use 
of  the  standard  techniques  of  the  stability  theory.  We  first 
multiply  the  quadratic  (6)  by  its  complex  conjugate  polynomial , 

(Q  + e)£2  - A£  + (Q  - e)  = 0, 

to  obtain  a polynomial  with  real  coefficients, 

[QO+Qe-K2e+ee]C4  - [Q+QH:+e]A£3  + [A2-K22-K22-(e2+e2)  ]£2 
- [Q+Q- (e+i)  ]AS  + [QQ-Qe-Qe+ee]  = 0, 

and  then  transform  the  5-plane,  using  5 = (l+w)/(l-w) , into  the 
w-plane  where  the  stability  condition,  |£|  <_  1,  transforms  to 
Re(w)£  0 . The  transformed  polynomial  is  of  the  form 

4 3 2 

aow  + a^w  + a2w  + a3w  + a4  = 0 (7) 

where  aQ  = (A-H2+Q) 2 + 4e2  > 0, 
a1  = 8 [Qe+Qe  + JsA(e+e)], 
a2  = 2 [A2  + (Q+Q) 2 + 4QQ  + 4ee  + 4e2]  > 0, 
a3  = 8[Qe-K3e  - JjA(e+e)  ] , 
a4  = [A  - (QK1)  ]2  + 4e2  > 0. 

Hie  numerical  dissipation  term,  e , which  was  added  to  Equ. (2) , 
has  the  effect  of  modifying  the  unconditional  neutral  (marginal) 


stability,  for,  with  e = 0 the  Routh-Hurwitz  sufficient  condi- 
tions for  stability  are  in  the  present  case: 

Ao  = ao  > °'  A1  = al  = °'  a2  = ala2~aoa3  = °' 

A3  = ala2a3~ao(ala4'a3)  = °'  A4  ~ a4A3  = 0- 

Thus  Re(w)  = 0 or  |£|  =1  unconditionally  with  zero  dissipation 
(e  = 0)  and  the  scheme  has  no  damping  or  amplification. 

The  addition  of  the  damping  term,  while  it  lowers  the  order 
of  temporal  accuracy  (unless  a « At2/Ax2Ay2) , modifies  the  co- 
efficients aQ,...,a4  of  the  transformed  polynomial  by  an  addition 
of  quadratic  terms  e2,  e2  to  the  even-numbered  coefficients 
while  making  the  odd-numbered  coefficients  proportional  to  the 
magnitude  of  e.  Reversing  the  sign  of  e changes  the  sign  of  a^ 
and  a3.  Since  the  necessary  condition  for  Re(w)  < 0 is  that  all 
the  coefficients,  aQ,...,a4,  be  positive,  the  sign  of  e (e.g. 
that  of  a in  Equ.  (7))  must  be  chosen  so  that  a.j+a3  = 16 
be  positive;  but  this  does  not  yet  quarantee  that  both  a^  and  a^ 
are  separately  greater  than  zero. 

Due  to  the  algebraic  complexity  the  full  set  of  inequalities, 
the  necessary  stability  conditions  aQ>0,  a^O,...,  a4>0  and  the 
Routh-Hurwitz  sufficient  conditions  Aq>0,  A^O,...,  A4>0  could 
not  be  given  in  a closed  form.  We  note,  however,  that  the  even 
numbered  coefficients  aQ,  a2,  and  a4  in  Equ. (7)  are  positive 


t 


definite.  Consequently  the  two  inequalities  a^>0  and  a^>0  are 
of  primary  concern  to  us.. 

Upon  a closer  examination  of  the  expression  for  33  the 
following  conclusions  were  reached.  With  0 — 2ttAx/\,  X = mAx 
= wavelength  of  disturbance,  m = 2,3,4,...,  thus  3 = 2ir/m; 
similarly,  y = 2-rr/n  where  n = 2,3,4,....  Short  waves  (e.g., 
rn=2  and/or  n=2)  are  definitely  always  stable  and  highly  danped, 
the  latter  property  being  very  desirable  as  it  prevents  the 
appearance  of  "wiggles"  and  the  "up-and-down"  oscillations  of 
the  numerical  solution  in  presence  of  shock  waves,  Long  waves, 
m,n  ■*  <»,  are  only  marginally  stable.  Again,  this  is  desirable. 
However,  certain  combinations  of  large  but  finite  wavelengths 
may  change  the  sign  of  depending  on  At,  the  Mach  nuntoer  and 
the  mesh  aspect  ratio  Ar  = Ay/Ax.  Numerical  evaluation  of  the 
roots  of  the  complex  polynomial  (6)  confirmed  this  conclusion. 
Instability,  |S|  > 1,  occurs  only  at  supersonic  points  and  for 
long  waves,  and  it  seems  to  require  the  combination  of  medium 
and  long  wavelengths,  e.g.,  8Ax  and  256Ay,  or  16Ax  and  128Ay, 
etc.  These  conclusions  were  confirmed  by  supercritical  flew 
calculations  on  a 56  « 32  point  mesh  where  the  shortest  observed 
oscillations  had  a wavelength  of  8 Ax.  These  oscillations  dis- 
appeared on  a coarser  mesh  due  to  the  boundary  conditions.  This 
is  a classical  case  of  a "long  wave  instability"  which  may  occur 
as  the  nesh  is  continuously  refined  until  the  unstable  long  waves 
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fit  between  the  boundaries,  that  is,  when  the  number  of  subdivisions 
of  the  computational  mesh  equals  the  half-wavelength  divided  by  the 
mesh  spacing,  or  Jjn  in  our  notation. 

Due  to  the  marginal  daitping  and/or  instability  of  the  long 
waves,  the  shock  wave  thickness  is  large,  on  the  order  of  the  half 
wavelength  of  the  waves  with  largest  amplitude  (that  is,  of  those 
having  the  least  damping) . The  dispersion  properties  of  such  waves 
should  be  studied  with  the  objective  of  modifying  the  computational 
scheme  so  as  to  improve  the  situation. 

The  obvious  changes  in  the  computational  scheme  that  offer 
hope  of  improving  the  performance  of  the  method  are  the  conservative 
differencing  and  the  type-dependent  differencing.  Since  the  shock 
wave  creates  the  major  disturbance  in  the  solution,  discrete  handling 
of  the  shock  waves  would  remove  most  of  the  difficulties  encountered 


in  the  present  work. 


• • 


SECTION  IV 

THE  COMPUTATIONAL  FORM 
OF  THE  FINITE  DIFFERENCE  SCHEME 

Using  centered  spatial  differences  we  put  the  scheme  (5) 
into  a form  emphasizing  one-dimensional  character  of  the  Alter- 
nating Directions  Iitp licit  method  (ADI) . Let 

- [atoV^atv2)  I }' 


Fjk  = Gjk  " S^{J5u(Gj+l,k'<3j-l,k) 


H.,  — 2<pn.  + 5 (4>n,  .-201?  . ■ -hp1}  , . ) 

Dk  Dk  2Ax 2 vy]+l,k  yj,k  y;|-l,k' 

_ *LAt^((}n  n ^n 

8 Ax2  3+2 ,k  3,k  3~2,k 

+ a ~(<pn  -2<pn  , -hpn  , ,) 

2Ay2  3'k+1  9 j ,k^j  ,k-lJ 

2 2 

3v  At  ,An  ^An  lAn  x 


With  the  solution  given  at  the  two  time  levels,  t = (n-l)At  and 
t = nAt,  the  functions  Gjk,  F^,  and  are  kncwn.  Equation  (5) 
takes  then  the  form 

xjk  + tl('5u(xj+l,k'xj-l,k) 

- [^y2^i(a2«2)I(X.+1(^2Xjk«..lik)}.Hj)t-Fjk  (8) 


13 


where  the  auxiliary  function  X_.^  is  defined  by 

xjk  - 

- [C4x24rt§(a2«2)  ] «"ki-2<jSj !k-l>  > • 

Equation  (8)  contains  the  unknown  X evaluated  only  at  three 
points  allowing  for  the  use  of  the  efficient  tri -diagonal  na.tr ix 
inversion  algorithm.  We  write  (8)  as 


(9) 


-V 

j+l/k  + BjXj»k  ~ 

CjXj-l,k 

•n 

Q 

II 

! A. 

3 

= aAtAy2  + 

4AX2 

(a2+u2)  - 

uAt 

22x 

Bi 

= 1 + 2aAtAy2  + 

^<a2+u2). 

J 

2Ax 

cj 

= aAtAy2  + 

4Ax^ 

(a2+u2)  + 

uAt 

2Ax 

D: 

= Hjk  " Fjk* 

(10) 


The  solution  of  Equ. (10)  for  X^k  is  to  be  followed  inmediately 
by  the  application  of  the  tri-diagonal  algorithm  to  Equ. (9)  written 
as 

n+1  n+1  n+1 

" Vj/k+l  + Vj  ,k  “ Ck4> j ,k-l  " Dk' 

2 

where  ^ = aAtAx2  + -^j(a2+v2)  - ^ , 


4Ay 


» 1 + 2aAtAx2  + -^-.(a2+v2)  , 

2Ayz 

°k  * °W  + * W ' “*  °k  * V 
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We  note  at  this  point  the  advantage  of  the  addition  of  the 

proper  terms  used  to  obtain  Equ. (2)  in  SECTION  II.  If  this 

2 2 

were  not  done,  the  terms  containing  the  factors  (a  +u  ) and 
2 2 

(a  +v  ) would  be  absent  from  the  expressions  for  the  coeffi- 
cients A,  B,  C given  above.  The  importance  of  retaining  such 
terms  follows  from  the  properties  of  the  tri-diagonal  algorithm. 
In  order  that  the  round-off  errors  of  the  tri-diagonal  solver 
not  grow  exponentially,  one  must  require  that  A > 0,  B > 0, 

C > 0 and  B > A+C.  Without  the  aforementioned  terms  the  first 
inequality  (or  the  third,  depending  on  the  direction  of  flow) 
could  be  violated  especially  for  small  dissipation  parameter  a. 
Computations  with  a = 0 would  not  be  possible.  With  the  ad- 
dition of  the  proper  terms  leading  to  Equ. (2)  the  round-off 
errors  will  be  bounded  if  we  require  that  the  Courant  number  C 

n 

satisfy  the  inequality 


This  condition  is  sufficient  and  not  unduly  restrictive  since 
2TV(1+M2)  _<  1 and  the  objective  of  implicit  method  is  to  avoid 
the  restriction  of  the  Courant-Friedrichs-Levi  stability  criterion 

C < 1. 

n — 
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SECTION  V 

THE  KUTTA  AND  WAKE  CONDITIONS 


The  inposition  of  the  Kutta  trailing  edge  condition  and  the 
wake  condition  presents  a special  problem  in  the  Alternating 
Direction  Method,  namely,  that  two  conditions  must  be  met  simul- 
taneously in  the  middle  of  the  interval  of  integration  in  the 
direction  normal  to  the  wake. 

In  practice,  the  trailing  edge  is  usually  located  between 
two  mesh  points  so  that  the  Kutta  condition  and  the  wake  condi- 
tion lead  to  the  same  physical  requirement  that,  for  all  points 
downstream  of  the  trailing  edge,  there  be  no  pressure  difference 
across  the  wake  and  that  the  mass  flow  across  the  wake  be  con- 
tinuous. For  simplicity,  we  take  the  wake  as  lying  in  the  plane 
of  the  wing.  Mathematically,  the  Kutta  and  wake  conditions 
become 

PU  = PL'  (a^D  = 0<f>/3  y)L' 

where  the  subscripts  ( )y  and  ( ) L denote  sides  of  the  wake 

corresponding  to  the  upper  and  lower  surfaces  of  the  airfoil, 
respectively. 

Using  the  exact  relations  for  an  irrotational  flow, 

YP/P  - YP*^  = a2  - £ - (Y-lXff  + M (|£>2*  (^.)2-  1JJ 
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and  the  continuity  of  the  normal  ccrrponent  of  velocity  we  obtain 
as  a condition  of  continuity  of  pressure  across  the  wake 

ft(*U  " V + “ (H}L}  = °* 

This  may  be  rewritten  as 

{!t  + hl  + ^L1  (l?}  }%  - *L  > = °*  (11) 

Thus  the  jump  in  the  potential,  A<J>  = ^ - <pL,  propagates  along 
the  wake  with  the  instantaneous  local  mean  velocity  Um, 

Um  + 

and  represents  the  vorticity  shed  fran  the  trailing  edge.  The 
potential  jurrp  is,  in  general,  a function  of  time  and  position 
in  the  case  of  oscillating  airfoil,  is  constant  in  space  in  the 
lew  frequency  approximation  and  constant  in  tine  in  the  steady 
flow  case.  The  jump  vanishes  only  for  a nonlifting  airfoil. 

The  continuity  of  the  normal  velocity  ccrrponent  may  be 
irtposed  conveniently  by  requiring  that  the  adjusted  potential, 

<P'  = <P  for  y<0,  4>'=<j>-A<t>  for  y > 0,  be  continuous  together 
with  its  first  derivatives  across  the  wake.  This  will  facilitate 
handling  of  the  boundary  conditions  in  the  y-direction  step  of 
the  ADI  method. 

If  the  accuracy  of  the  finite  difference  approximation  to 
Equ. (11)  is  not  of  primary  concern,  one  may  finite-difference 
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in  the  potential  being  carried  unchanged  along  the  vortex  sheet 
with  the  mean  velocity  having  new  two  ccrpanents  in  the  plane  of 
the  wing. 

Hie  treatment  of  the  wake  suggested  here  is  only  approxi- 
mate since  the  wake  does  not  remain  in  the  plane  of  the  wing. 

A discrete  handling  of  the  wake  as  a moving  discontinuity,  while 
it  offers  elegance  and  accuracy,  would  soon  lead  to  ccnplications 


due  to  a natural  instability  of  a vortex  sheet  which  has  a tendency 
to  roll  up  into  large  vortex  structures.  Out  of  two  evils  we  suggest 
to  use  the  smaller. 


SECTION  VI 


^RECOMMENDATIONS 


There  are  two  obvious  shortccmings  of  the  proposed  numerical 
scheme  at  this  stage  of  its  development,  namely,  the  neutral  sta- 
bility with  no  artificial  dissipation  (a  = 0) , and  the  poor  re- 
solution of  shock  waves.  Further  investigation  of  the  effects 
of  adding  higher  order  dissipative  terms  without  long-wave  desta- 
bilization of  the  scheme  should  be  carried  out  systematically 
with  the  objective  of  retaining  a second  order  accuracy  in  both 
time  and  space  while  assuring  stability  independently  of  Mach 
number,  mesh  size  and  the  coordinate  system. 

Conservative  differencing  is  necessary  to  capture  the  shock 
wave  motion  properly.  The  present  method  should,  therefore,  be 
applied  to  the  divergence  (conservative)  form  of  the  potential 
equation, 

a2 

It  [|£+  W)2]  + V-u|i+  Js(V4>)2-  = 0, 

which  is  equivalent  to  Equ.  (1)  when  expanded.  However,  since 
finite  difference  operators  applied  to  nonlinear  terms  do  not 
commute,  additional  terms  would  appear  in  the  finite  difference 
equations  (8)  and  (9) . Conservative  differencing  conserves 
fluxes  of  conserved  quantities  and  this  reduces  truncation  errors 
across  discontinuities  which  errors  govern  the  amplitudes  of  the 


r" 

oscillations  of  the  solution  at  the  shock. 

Conservative  differencing  alone  would  not,  however,  improve 
handling  of  the  shock  waves  sufficiently.  A type-dependent 
scheme  (upstream  differencing  at  supersonic  points)  could  be 
used  to  inprove  shock  capturing  and  the  shock  wave  resolution. 
Conservative  differencing  would  not,  per  se,  remedy  the  stabi- 
lity problem  since  the  stability  analysis  of  the  linearized 
finite-difference  equation  would  not  be  affected  by  conservative 
differencing. 

The  author  is  unaware  of  any  numerical  experiments  and/or 
theoretical  arguments  for  or  against  the  following  idea.  In 
order  to  eliminate  the  unwanted  downstream  dependence  in  super- 
sonic regions,  the  x-direction  (streanwise)  step  of  the  ADI 
method  could  be  broken  up  into  three  steps  (the  y-ccmponent  of 
velocity  will  ordinarily  remain  subsonic) . In  a transonic  flow 
the  line  y = constant  will  cut  the  imbedded  supersonic  zone 
at  two  points.  Custanarily , this  fact  is  ignored  and  two  boundary 
conditions  are  used,  one  at  the  inlet  (x  -*■  -»)  and  one  at  the 
outlet  (x  ■+■  +°°)  boundaries  of  the  ccnputational  region.  The  tri- 
diagonal algorithm  calculates  the  intermediate  solution  in  the 
x-  direction  ADI  step  subject  to  these  boundary  conditions  treat- 
ing the  problem  as  a typical  two-point  boundary  value  case.  If, 
instead,  the  y = constant  line  is  divided  into  three  segments, 
a subsonic  region  between  x = and  the  sonic  line,  followed  by 
a supersonic  region  between  the  sonic  line  and  the  shock  wave, 
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and  another  subsonic  region  between  the  shock  and  x = + °°,  then 
each  of  the  subsonic  regions  represents  a two-point  boundary 
value  problem  solvable  by  the  tri-diagonal  algorithm,  while  in 
the  supersonic  region  we  need  two  initial  conditions  at  the  sonic 
line  (e.g. , the  value  of  the  potential  and  its  x-derivative) , 
and  a recursion  formula,  essentially  given  by  Equ.  (10) , would 
provide  a solution  not  depending  on  the  upstream  boundary.  The 
solution  in  the  supersonic  region  would  yield  an  ini ticil  condi- 
tion for  the  continuation  of  the  solution  in  the  subsonic  region 
downstream  of  the  shock.  A simple  modification  of  the  ADI 
procedure  nay  thus  accomplish  what  the  upstream  differencing 
technique  had  as  a sole  monopoly  - namely,  that  of  imposing  cor- 
rectly the  rule  of  forbidden  signals.  Quite  possibly,  the  type- 
dependent  ADI  algorithm  proposed  here  could  be  used  successfully 
in  conjunction  with  central  differencing  thus  preserving  the 
second  order  spatial  accuracy. 

The  above  proposed  type-dependent  ADI  method  could  be  easily 
implemented  in  any  existing  program  using  the  tri-diagonal  algorithm. 
It  modifies  the  standard  use  of  the  ADI  method  from  its  customary 
applications  to  elliptic  problems  to  the  solution  of  mixed  and 
hyperbolic  problems. 
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ABSTRACT 


A constant  problem  of  interest  to  the  Air  Force  is  the  design 
of  acoustically  sound  aircraft  structural  components.  This  is  because 
sonic  fatigue  failures  have  resulted  in  unacceptable  maintenance  and 
inspection  burdens  associated  with  the  operation  of  the  aircraft. 

In  some  instances,  sonic  fatigue  failures  have  resulted  in  major 
redesign  efforts  of  structural  components.  Currently,  the  sonic 
fatiaue  design  methods  in  practice  are  completely  based  on  the  linear 
or  small  deflection  theory  (Sonic  Fatigue  Design  Guide  for  Military 
Aircraft,  AFFDL-TR-74-112,  for  example).  But,  on  the  contrary,  the 
test  structural  panels  respond  nonlinearly  with  large  deflections  at 
high  intensity  acoustic  pressure  levels.  This  large  amplitude 
geometrical  nonlinearity  is  identified  to  be  the  major  factor  that 
causes  the  enormous  disagreement  between  the  computed  and  the  measured 
random  responses.  To  improve  the  sonic  fatigue  design  method, 
therefore,  large  deflection  or  nonlinear  structure  theory  has  to  be 
employed  in  the  analysis.  This  paper  first  gives  a thorough  survey- 
type  review  of  various  existing  analytical  and  numerical  methods  on 
random  excitations  of  nonlinear  mul tidegree-of  freedom  systems,  and 
an  evaluation  of  these  methods  based  on  some  realistic  considerations 
from  the  point  of  view  of  their  application  to  complex  panel  configura- 
tions of  aircraft  structure.  These  are  the  Fokker-Planck  equation 
method,  equivalent  linearization  technique,  perturbation  approach, 
finite  difference  method,  finite  element-equivalent  linearization 
approach,  etc.  Then,  a mathematical  formulation,  which  is  based  on 
finite  element  method  and  equivalent  linearization  technique,  for  com- 
plex structural  panels  to  high  noise  environment  is  developed. 
Statistical  responses  of  nodal  deflections  and  element  strains  using 
a single-mode  approximation  are  given  in  terms  of  the  linear  frequency, 
spectral  density  of  excitation  pressure,  equivalent  linear  frequency, 
and  strain-deflection  transformation  matrices.  Extension  of  the  quasi - 
1 inearization  method,  which  has  been  used  successfully  in  analysis 
of  large  amplitude  vibrations  of  complex  structures,  to  the  present 
nonlinear  random  excitation  problem  is  also  discussed  briefly. 
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NOMENCLATURE 


[c] 

Element  damping  matrix 

[C], 

ra. 

Damping  matrices 

CCJ 

Diagonal  generalized  damping  matrix 

e 

Error  of  linearization  or  equation  deficiency 

EC  ] 

Operator  denotes  the  mathematical  expectation 

{f} 

Element  nodal  force  vector 

<F}, 

ms 

Nodal  force  vectors 

9 

Structural  damping  coefficient 

H(n) 

Frequency  response  function 

[k] 

Element  stiffness  matrix 

[K], 

[K]s 

Stiffness  matrices 

ft! 

Diagonal  generalized  stiffness  matrices 

[kg] 

Element  geometrical  stiffness  matrix 

[Kg] 

. [K9]s 

Geometrical  stiffness  matrices 

Keq 

Equivalent  linear  stiffness  constant 

Cm] 

Element  consistent  mass  matrix 

CM], 

[M]s 

Consistent  mass  matrices 

Ml 

Diagonal  generalized  mass  matrix 

{P} 

Force  vector  in  modal  coordinates 

{q} 

Vector  of  node  displacements  normal  to  the  surface  of 
structure 

{g>s 

Nodal  displacement  vector 

[Vk] 

Deflection  covariance  matrix 

— 


[S]r[S]2 

CSF(Q)] 

Sp(fl) 

(6} 

(e} 

[i^l 

C 

X,u 


UJ 

{<»(j)} 

[•] 

u 

“eq 

Q 


Strain  - deflection  transformation  matrices 
Cross  spectral  density  matrix  of  {F} 

Spectral  density  function  of  P 
Element  nodal  displacement  vector 
Element  strain  vector 
Strain  covariance  matrix 
Damping  ration 

Proportionality  constant  between  damping  and  stiffness 

and  inertia,  respectively 

Amplitude  vector  in  modal  coordinates 

j-th  normalized  eigen  vector 

Modal  transformation  matrix 

Linear  undamped  frequency 

Equivalent  linear  frequency 

Angular  frequency 
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I.  INTRODUCTION 

The  response  of  outside  surfaces  or  skins  of  an  aircraft 
structure  to  high  intensity  acoustic  pressure  levels  has  been  the  sub- 
ject of  considerable  research  effort  (References  1 and  2,  for  example). 
This  complex  problem  may  separate  to  three  parts: 

(1)  Prediction  of  the  acoustic  loading, 

(2)  Determination  of  the  response  of  complex  structural  panels 
to  this  excitation,  and 

(3)  Estimation  of  the  fatigue  life. 

There  are  considerable  data  available  to  predict  the  acoustic 
loads  on  an  aircraft  structure  due  to  the  many  possible  sources  of 
high  sound  levels.  These  sources  are  normally  classified  as  to  pro- 
pulsion system  noise  sources  and  aerodynamic  noise  sources.  Noise 
prediction  methods  for  various  sources  have  been  summarized  in  two 
excellent  reports  (References  2 and  3). 

Basic  design  considerations  and  procedures  to  estimate  the 
fatigue  life,  various  cumulative  damage  theories,  and  fatigue  curves 
describing  the  S-N  characteristics  for  various  materials  are  given  in 
References  1 and  2.  Fatigue  design  data  for  bonded  aluminum  struc- 
tures can  be  found  in  Reference  4. 

The  present  study  concentrates  on  the  response  aspects  of  the 
problem,  even  there  is  considerable  overlapping  of  these  three  areas. 
Special  effort  is  given  to  incorporate  the  nonlinear  response  effect, 
which  is  mainly  due  to  large  deflections,  into  the  analysis.  The 
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reason  for  this  is  simple  because  there  is  enormous  discrepancy 
between  the  measured  and  the  computed  responses.  Those  computed 
results  were  based  on  linear  or  small  deflection  theory  while  test 
panels  responded  nonli nearly  with  large  deflection  at  sufficiently 
high  intensity  noise  levels.  These  documented  evidence  are  summarized 
briefly  in  Section  II. 

In  approaching  this  problem  of  random  excitation  of  nonlinear 
systems,  one  can  turn  to  a number  of  prior  investigations.  There- 
fore, an  important  task  of  this  study  is  thus  to  review,  classify, 
and  evaluate  these  existing  techniques.  The  evaluation  is  based  on 
some  realistic  considerations  from  the  point  view  of  their  application 
to  complex  panels  of  aircraft  structure.  These  are  presented  in 
Section  III.  And  it  becomes  evident  from  the  survey  that  more 
research  is  needed  in  the  nonlinear  random  vibration  area. 

Section  IV  gives  a mathematical  formulation,  which  is  based  on 
the  finite  element  method  and  the  equivalent  linearization  approach, 
for  complex  structures  subjected  to  random  acoustic  excitation.  An 
iterative  scheme  used  in  the  solution  procedure  and  a simplified  flow 
chart  are  also  presented.  Ratio  of  nonlinear  root  mean-square  (RMS) 
deflection  to  linear  RMS  deflection,  that  can  be  expressed  in  terms 
of  linear  frequency,  equivalent  linear  frequency,  and  power  spectral 
density  (PSD)  of  the  excitation,  is  given.  A very  crude  estimate  on 
the  improvement  of  predicting  responses  using  large  deflection  theory 
is  also  included  in  this  section. 


Section  V gives  a review  of  the  advances  in  the  development  of 
geometrical  stiffness  matrices  for  various  finite  elements.  These 
matrices  are  required  in  the  nonlinear  random  vibration  analysis. 

A "Quasi-Linearization"  method  which  is,  at  present,  in  the 
conceptual  phase  is  discussed  briefly  in  Section  VI.  Conclusions  and 
recommended  future  work  are  given  in  Section  VII. 


II.  EVIDENCE  OF  LARGE  AMPLITUDE  NONLINEARITY 


Some  aspects  of  current  knowledge  about  the  response  of 
structural  panels  to  high  intensity  noise  are  discussed  in  this 
section.  Experiments  and  tests  on  aircraft  structural  components 
have  displayed  behavior  that  is  not  consistent  with  linear  or  small 
deflection  theory  assumptions.  The  deviations,  which  differ  for 
various  structural  configurations,  are  suggestive  of  two  types  of 
nonlinearity  sources: 

(1)  Nonlinear  damping  ratio,  and 

(2)  Nonlinear  relation  between  displacement  and  strain. 

Commonly  used  methods  for  determining  damping  ratio  are  the 

bandwidth  method  by  measuring  half-power  widths  at  modal  resonances, 
and  the  decay  rate  method  by  measuring  the  logarithmic  decrement  on 
decaying  modal  response  traces.  The  values  of  damping  ratio  range 
generally  from  0.001  to  0.025  for  common  panel  constructions  used 
in  aircraft  structure.  For  such  relatively  small  damping  coefficients, 
it  is  a reasonable  assumption  that  the  effects  to  the  structural 
behavior  due  to  nonlinear  damping  would  be  also  small  in  comparison 
with  due  to  large  amplitude.  This  has  been  an  observed  evidence  in 
many  experiments  and  is  discussed  and  summarized  in  the  following. 

Many  documents,  for  example  references  5 to  8,  have  repeatedly 
reported  poor  comparison  between  measured  and  calculated  RMS  responses. 
They  all  observed  that  the  test  panels  responded  with  large  deflec- 
tions at  high  sound  pressure  levels,  and  the  computed  responses  were 
based  on  linear  small  deflection  theory.  This  is  the  major  reason 


that  causes  the  huge  discrepancy  between  measured  and  calculated 
results.  This  experimental  evidence  was  taken  from: 

(1)  ASD-TDR-62-26  (Reference  5) 

Fitch  et  al  observed  the  nonlinear  stress  response  at  rela- 
tively low  siren  excitation  level  for  conventional  skin-stringer 
panels  as  shown  in  Figure  1.  Their  explanation  to  this  nonlinear 
behavior  was  the  diaphragm  action  which  limits  the  amplitude  of 
deflection  of  the  vibrating  plates.  Examination  of  Figure  1 indi- 
cates that  as  the  excitation  level  is  raised  above  104  dB,  the 
structural  response  begins  deviating  away  from  the  linear  assumption. 

(2)  AFFDL-TR-68-44  (Reference  6) 

Three  types  of  typical  aerospace  structures  were  tested  at  an 
average  sound  pressure  level  (SPL)  of  157  dB.  They  are  a skin- 
stringer  3-bay,  a honeycomb  sandwich,  and  a corrugated  sandwich  panel. 
Measured  RMS  deflection  on  the  skin-stringer  panel  is  surprisingly 
large  of  0.064  inch,  or  twice  the  skin  thickness.  Measured  honey- 
comb panel  RMS  deflection  is  0.031  inch,  or  1.4  thickness  of  the  top 
skin.  Comparisons  of  RMS  deflection  showed  that  calculated  skin- 
stringer  panel  deflection  is  34-percent  high  and  calculated  honey- 
comb panel  deflection  is  5-percent  high  only.  Comparisons  between 
calculated  and  measured  RMS  stresses  were  poor.  They  are  shown  in 
Table  1 taken  from  Reference  6. 
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TABLE  1 - STRESS  COMPARISON 


Panel  type 

Stress 

component 

RMS  stress  (kpsi) 

Calculated 

Measured  on  panel : 

Measured 

average 

A 

B 

C 

D 

E 

Skin-stringer 

•Ja  2 

X 

7.7 

2.2 

2.9 

2.5 

... 

2.2 

2.5 

\l  cr  2 

V 

2.4 

0.63 

0.94 

0.78 

1.1 

0.84 

0.87 

Honeycomb 

2.6 

2.0 

1.8 

1.2 

1.2 

1.3 

1.5 

JFT 

y 

2.1 

1.3 

1.3 

0.91 

0.84 

1.3 

1.1 

Strain  components  were  also  measured  with  rosettle  strain  gauges 
mounted  on  both  the  upper  and  lower  surfaces.  Appreciable  membrane 
stress  was  recorded,  this  implies  the  panels  responding  with  large 
deflections.  The  stress  levels  measured  were  actually  very  low, 
showing  that  the  nonlinearity  is  not  associated  with  yield  or 
plasticity  of  the  material,  but  rather  with  coupling  of  inplane  and 
out-of-plane  displacements. 

(3)  AFFDL-TR-71 -1 26  (Reference  7) 

Three  nine-bay,  cross-stiffened,  graphite-epoxy  panels  were 
exposed  in  a broad-band  166  dB  SPL  acoustic  environment.  Under 
loud-speaker  excitation,  the  lowest  natural  frequency  was  at  174  Hz. 
But  under  the  acoustic  pressure  in  the  progressive  wave  test  chamber, 
the  lowest  frequency  increased  from  200  Hz  to  290  Hz  as  the  SPL  was 
increased  from  139  dB  to  166  dB.  The  increase  in  natural  frequency 
with  increasing  pressure  level  is  attributed  to  the  large  deflections 
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of  the  panel  response.  Again  strain  comparisons  were  poor  as  shown 
in  Table  2 taken  from  Reference  7.  And  deflection  was  not  measured. 


TABLE  2 - COMPARISON  OF  EXPERIMENTAL  RESULTS 
OF  CROSS-STIFFENED  PANELS  WITH  RESULTS 
USING  UNSTIFFENED  PLATE  THEORY 


1 • 

Fundamental 

Strain  at 

Strain  at 

Approach 

Method 

Frequency 

x = o*  y = 2 

x = |,  y = 0 

(Hz) 

(micro-inch/inch-rms) 

(micro-inch/inch-rms) 

Analytic 

Simplified  theory 
(Beam  Functions) 

182 

180(2) 

406(2) 

Analytic 

Finite  Element 
(REDYN) 

180 

180(2) 

348(2) 

Experi- 

mental 

Test  Panel 
A-GG-B-2 

187(1) 

96(4) 

160(3) 

Expe  ri- 
me ntal 

Test  Panel 
A-UU-B-3 

170(1) 

74(4) 

164<3> 

(1)  Obtained  during  damping  factor  determination  under  loudspeaker  excitation 

-6  2 

(2)  Strain  response  to  fully  correlated,  white  noise  excitation  of  1.2  x 10~  psi  /Hz 

(3)  Strain  gage  No.  2 reading  during  139  db  run 

(4)  Strain  gage  No.  7 reading  during  139  db  run 

(4)  AFFDL-TR-77-45  (Reference  8) 

Total  of  ten  bonded  aluminum  panels  were  tested.  Table  3 taken 
from  Reference  8 gives  the  ratio  of  the  fundamental  frequency  obtained 
under  the  loud-speaker  excitation  to  the  frequency  at  which  the  peak 
strain  PSD  occurred  in  the  sonic  fatigue  test  at  166  dB  overall  SPL. 
Again,  the  agreement  between  the  test  strains  and  the  predicted 
strain  was  poor,  as  shown  in  Table  4 (taken  from  Reference  8)  and  no 
measured  deflection  was  reported. 


Therefore,  a conclusion  can  be  reached  that  any  real  structure 
designed  for  service  in  high  sonic  environment  regions  would  respond 
with  large  deflection  nonlinearity. 

TABLE  3 - FREQUENCY  AND  DAMPING  DATA 


PANEL 


Average 


0.018 

0.015 

0.011 

0.009 

0.012 

0.011 

0.023 

0.009 

0.012 

0.014 


0.0134 


NATURAL  FREQUENCIES 

fIrst 

MODE 

SECOND 

MODE 

T^IRD 

MODE 

FOURTH 

MODE 

(Hz) 

(Hz) 

(Hz) 

(Hz) 

137 

189 

265 

340 

97 

147 

179 

266 

144 

209 

284 

386 

141 

203 

2 79 

376 

165 

245 

- 

- 

141 

210 

295 

413 

80 

114 

155 

226 

84 

124 

172 

235 

103 

155 

211 

292 

99 

153 

215 

300 

(1)  Nondimensional  viscous  damping  factor,  — 

(2)  The  parameter  f was  the  frequency  of  £he  predominant  strain  response 
at  166  dB  overaliSPL. 


TABLE  4 - STRAIN  PREDICTIONS  BASED  ON  THE  ASSUMPTION 
OF  FULLY  CLAMPED  EDGES  OF  A PLATE 


OVER- 

ALL 

SPL 

PRESS- 

URE 

PSD 

RMS 

STRAINS 

GAGE 

NO.  2 

GAGE  NO.  4 

GAGE  NO.  11 

TEST 

PRE- 

DICTED 

TEST 

Pre- 

DICTED 

( 

TEST 

Pre- 

DICTED 

(dB) 

(psi2/Hz) 

(H”/") 

(H’"/") 

(H’7M) 

(n’7") 

142 

5.0  x 10'6 

32 

445 

27 

111 

84 

270 

145 

3.4  x lO'3 

75 

766 

110 

192 

178 

466 
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III.  REVIEW  OF  EXISTING  APPROACHES 
ON  NONLINEAR  RANDOM  VIBRATION 


Methods  used  to  model  structural  systems  can  be  basically 
divided  into  analytical  methods  and  numerical  methods.  Analytical 
approaches  are  usually  focused  at  obtaining  explicit  closed-form 
quantitative  results  for  simple  structural  configurations.  For 
complex  structures  such  as  found  in  aircraft  design,  numerical  methods 
must  invariably  be  employed  to  accurately  model  the  complex  configura- 
tion. However,  analytical  methods  can  generally  provide  more  insight 
and  lead  to  a better  understanding  of  the  problem.  And  other  approxi- 
mate and  numerical  techniques  can  then  be  extended  from  there.  In 
this  section,  both  analytical  and  numerical  approaches  for  solving 
random  excitation  of  nonlinear  systems  are  reviewed. 

1 . Analytical  Methods 

a.  Fokker-Planck  Equation  Approach.  The  most  general  extension 
of  the  Fokker-Planck  equation  method  to  the  multi  pi e-degree-of- 
freedom  (mul tipi  e-DOF)  systems  of  nonlinear  second  order  equations 
is  due  to  Caughey  (References  9 and  10).  One  great  advantage  of  this 
method  over  all  of  the  other  approaches  is  that  it  gives  an  exact 
solution.  However,  this  should  not  conceived  that  all  problems  re- 
lating to  the  response  of  nonlinear  systems  with  random  excitation  have 
been  solved.  By  the  fact,  exact  solutions  of  the  steady-state  pro- 
bability function  have  only  been  found  for  certain  restricted  classes 
of  problems  provided: 


(1)  The  only  energy  dissipation  in  the  system  arises  from 
damping  forces  which  are  proportional  to  the  velocity, 

(2)  The  exciting  forces  are  uncorrelated  Gaussian  white 

noise, 

(3)  The  spectral  density  matrix  of  the  excitation  is 
proportional  to  the  damping  matrix  of  the  system,  and 

(4)  The  restoring  force  vector  of  the  system  is  derivable 
from  a potential . 

The  solution  of  the  time-independent  Fokker-Planck  equation 
under  these  conditions  represents  a very  significant  accomplishment. 
Problems  of  simple  structures  which  satisfy  these  four  conditions 
were  solved  by  Herbert  (Reference  11  and  12).  Vet,  many  problems  of 
practical  interest  do  not  satisfy  those  conditions  necessary  for  a 
solution.  The  required  relationship  between  the  excitation  and  the 
damping  matrix  is  particularly  restrictive.  In  addition,  the 
transitional  probability  density  function  generally  can  not  be  found 
with  the  Fokker-Planck  approach.  Without  this  transitional  probab- 
ility, it  is  generally  impossible  to  obtain  the  correlation  function 
and  PSD  of  the  response.  Thus,  a number  of  approximate  techniques 
have  been  developed  to  treat  a broader  class  of  problems  than  is 
presently  possible  with  the  exact  analysis.  These  are  the  equivalent 
linearization  technique,  perturbation  method,  etc. 

b.  Equivalent  Linearization  Approach.  This  method  was  originated 
by  Krylov  and  Bogoliubov  (Reference  13).  Caughey  (Reference  14)  has 
extended  the  equivalent  linearization  technique  to  systems  of  non- 


linear  differential  equations.  In  his  formulation,  the  correlation 
function  matrix  of  the  excitation  must  be  diagonalized  by  the  same 
transformation  that  diagonalizes  the  linear  mass,  damping,  and 
stiffness  matrices.  This  represents  a rather  severe  limitation  and 
in  particular  precludes  the  application  of  this  formulation  to  dynamic 
systems  which  are  excited  randomly  at  only  several  nodal  points. 

Lin  (Reference  15)  used  the  equivalent  linearization  method  with  a 
single-mode  approach  and  obtained  response  for  a rectangular  panel 
subjected  to  randomly-varying  loadings.  Seide  (Reference  16)  has 
employed  the  formulation  by  Caughey  and  obtained  solution  for  a simple 
beam  subjected  to  uniform  pressure  excitation  uncorrelated  in  time. 

Foster  (Reference  17),  and  Iwan  and  Yang  (Reference  18)  have 
extended  the  equivalent  linearization  technique  by  removing  the  restric- 
tion imposed  on  the  transformation.  Foster's  formulation  is  much 
general.  He  first  replaced  the  original  n-DOF  second  order  system  by 
a 2n-D0F  first  order  system.  The  determination  of  the  equivalent  linear 
stiffness  coefficients  is  then  accomplished  by  the  inversion  of  a 
2n  x 2n  mean-square  matrix  and  an  interative  procedure.  Pratical 
application  of  this  approach  to  a simple  deep-ocean  tower  frame 
structure  is  reported  in  Reference  19. 

c.  Perturbation  Approach.  A perturbation  method,  based  on 
classical  perturbation  theory,  was  developed  by  Crandall  (Reference 
20)  to  obtain  approximate  solutions  to  nonlinear  systems,  containing 
a small  parameter,  excited  by  a weakly  stationary  random  Gaussian 
processes.  In  principle,  the  perturbation  approach  can  be  extended 
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to  systems  of  coupled  nonlinear  equations  in  which  the  nonlinearities 
contain  a small  parameter.  Lyon  (Reference  21)  used  this  method  to 
study  the  responses  of  a nonlinear  strong.  Tung,  Penzien,  and  Horonjeff 
(Reference  22)  used  the  perturbation  procedure  to  a two  DOF  system. 

For  complex  structures,  however,  the  algebraic  operations  may  become 
so  unwieldy  that  the  method  is  no  longer  practical.  In  addition,  that 
there  are  certain  subtle  questions  about  the  convergence  of  the 
power-series  expansion  for  the  nonlinear  response  still  remaining 
unanswered. 

d.  Other  Approximate  Methods.  Fox  et  al  (Reference  23)  have 
developed  three  new  approaches:  (1)  Direct  Evaluation  of  Spectra, 

(2)  Estimates  of  Equilibrium  Distribution,  and  (3)  Generalized  Kinetic 
Equation.  They  have  applied  the  direct  evaluation  of  spectra  method 
to  a hinged  uniform  beam  with  the  assumptions  that  the  force  spectrum 
is  white  and  that  all  DOF  are  equally  forced  and  have  the  same 
damping  factor.  Extension  of  these  methods  to  complex  structures 
would  certainly  require  considerable  efforts. 

2.  Numerical  Methods 

Numerical  methods  can  be  subdivided  into  two  categories: 
numerical  solutions  to  differential  equations  or  finite  difference 
method,  and  matrix  displacement  method  based  on  discrete  element 
idealization  or  finite  element  method. 

a.  Finite  Difference  Approach.  Numerical  solutions  to  differen- 
tial equations  are  somewhat  restricted  so  that  these  techniques  can 


be  practically  applied  only  to  simple  structural  configurations. 

Belz  (Reference  24)  used  the  finite  difference  approach  and  obtained 
statistical  response  for  a simple  uniform  beam  subjected  to  a single 
concentrated  load  at  the  midspan  of  the  beam. 

b.  Finite  Element  Method.  Application  of  the  finite  element 
methods  to  1 inear  structures  subjected  to  random  excitations  have 
been  presented,  for  example,  by  Jacobs  and  Lagerquist  (References 
6 and  25),  Olson  and  Lindberg  (Reference  26),  Jacobson  (Reference  7), 
and  Olson  (Reference  27).  In  Reference  26,  refinement  on  the 
continuity  between  the  stiffeners  and  the  panel  itself  was  introduced. 
Olson  in  Reference  27  presented  a consistent  formulation  for  the 
cross  spectral  density  matrix  of  the  excitation.  Both  should  give 
improvement  in  the  accuracy  of  predicting  random  responses  for  linear 
structures. 

Application  of  the  finite  element  method  to  deep-ocean  towers 
has  been  given  by  Foster  (Reference  19)  and  to  off-shore  towers  by 
Penzien  et  al  (Reference  28).  In  the  problems  they  solved,  the 
nonlinear  effects  can  be  expressed  explicitly  in  terms  of  the 
displacements  or  velocities.  This  is  not  the  case  for  problems  of 
complex  panel  responses  to  high  intensity  noise.  The  nonlinear  effect 
due  to  large  deflections  or  the  geometrical  stiffness  matrix  is  not 
known  a priori. 

Extension  of  the  finite  element  approach  to  complex  structures 
under  high  noise  environment  with  large  deflection  nonlinear  effect 
is  the  purpose  of  this  research.  A matrix  formulation  which  is  based 
on  the  finite  element  method  and  the  equivalent  linearization 
technique  is  developed  and  presented  in  next  section. 
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IV.  MATHEMATICAL  FORMULATION 

1.  FINITE  ELEMENT  REPRESENTATION 

The  structures  considered  will  be  restricted  to  stable  struc- 
tural systems  which  are  highly  resonant,  that  is,  with  little 
damping.  These  are  typical  properties  for  panel  components  of  air- 
craft structure. 

In  the  matrix  structural  analysis,  structure  is  idealized  into 
a finite  number  of  discrete  structural  elements  connected  at  node 
points.  The  physical  properties  of  the  structure  are  assumed  to  be 
lumped  into  individual  elements.  The  stiffness  equations  of  motion 
for  such  an  element  under  the  influence  of  dynamic  loading,  inertia, 
damping,  elastic,  and  nonlinear  large  deflection  characteristics  are: 

Mfs}+[cJfS)*(M  + [-lk>((S}>])fS}  = If'*'}  «> 

Where  {6}  and{f}  are  vectors  of  nodal  displacements  and  applied 
forces,  respectively.  The  consistent  mass  [m],  damping  [c],  and  linear 
stiffness  [k]  matrices  have  been  developed  for  almost  every  beam, 
plate,  and  shell  elements  available.  The  element  geometrical  stiff- 
nexx  matrix  or  nonlinear  stiffness  matrix  [k^({6})],  which  is  displace- 
ment dependent  and  is  induced  due  to  large  deflections,  will  be 
discussed  in  Section  V. 

By  assembling  all  the  elements,  and  applying  the  kinematic 
boundary  conditions,  the  equations  of  motion  of  the  structure  are: 
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In  which  {q}$  and  {F>s  denote  the  vectors  of  nodal  displacements  and 
forces  of  the  structure.  Matrices  [M]$,  [C]$,  [K]$,  end  [K9]$  are 
the  mass,  damping, stiffness,  and  geometrical  stiffness  coefficients 
of  the  structure,  respectively. 

Applying  the  static  condensation  (or  Guyan  reduction)  and  retain- 
ing only  those  DOF,  say  m of  them,  normal  to  the  surface  of  the 
structure,  Eq.  (2)  may  be  written  as: 

ml)  * ECJfj}  +([K]  + -fF^} 

rnxm  rnxl 

where(qjis  a vector  containing  all  nodal  deflections  normal  to  the 
panel  structure.  The  matrices  [M],  [C],  [K],  and  [K9]  denote  the 
reduced  mass,  damping,  stiffness,  and  geometrical  stiffness,  respec- 
tively. 

2.  DAMPING  REPRESENTATION 

For  certain  forms  of  damping,  the  coupled  nonlinear  equations 
of  motion,  Eq.  (3),  can  be  reduced  to  a set  of  equations  which  contain 
coupling  only  in  the  nonlinear  terms.  This  requires  the  determination 
of  the  eigen  values  and  eigen  vectors  of  the  undamped  linear  system 


in  which  w.  is  the  natural  frequency  and  is  the  corresponding 

J 

j-th  mode  shape  of  the  linear  structure. 

Apply  a coordinate  transformation,  from  the  nodal  displacements 
to  the  modal  coordinates,  by 


rn  xj. 


“l-jrvw-im} 

m*n  n*L 


n^rn 


CS) 
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in  which  {£}  represents  a vector  of  modal  coordinates.  Substituting 
Eq.  (5)  into  Eq.  (3)  and  premultiplying  by  the  transpose  of  [$>], 

Eq . ( 3 ) becomes 

+ r$frK,d?Ki[$]f5}-fP'*'}  a>> 

where  {P}  = 0]T{F}  is  the  generalized  force  vector  in  modal  coordi- 
nates. The  terms  and  are  the  j-th  generalized  mass  and 
stiffness  defined  by 

The  equations  of  motion,  Eq.  (6)  will  contain  coupling  only  in 
the  nonlinear  terms  if  the  viscous  damping  (Reference  29)  is  propor- 
tional to  inertia,  stiffness,  or  both,  that  is 

+\[KJ  (S) 

where  y and  \ are  proportional ity  constants.  Then,  the  j-th  general- 
ized damping  coefficient  is  given  by 

cr  f/Ticjff''] 

^yuyij  + K Kj  j-),2-,"  ,”  (9) 

Structural  damping  is  another  form  of  damping  that  allows  it  to  be 
uncoupled,  that  is 
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[C]=^[KJ 


CIO) 


where  g(g«l)  is  the  structural  damping  coefficient.  The  j-th 
generalized  damping  coefficient  is 


a i) 


Sometimes  it  is  more  convenient  to  represent  the  damping  as  a 
fraction  of  critical  damping.  The  modal  damping  ratio  represents 
the  fraction  of  critical  damping  in  the  j-th  mode.  This  ratio  is 
related  to  viscous  damping  proportionality  constants  y and  X in 
Eq.  (8),  and  to  structural  damping  coefficient  g in  Eq.  (10)  by  the 
relationships 


(•&  +K<°3 


for  viscous  damping 


, for  structural  damping 


OZ) 


When  damping  can  be  represented  by  proportional  viscous  or 
structural  damping,  then  equations  of  motion,  Eq.  (6),  can  be  written 
as 


fMJfS}  +CKjfe}+[i]TfK%SP][f  m 

-fP}  03) 


The  j-th  row  of  Eq.  (13)  has  the  form 


' M, Zj  *Cj  ij  * Kj  ^ + 1 |, 1,  ir<  K's 


(I4-) 


which  has  coupling  only  in  the  nonlinear  term. 
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3.  EQUIVALENT  LINEARIZATION  APPROACH 

The  basic  idea  of  the  equivalent  linearization  method  (References 
14  and  17)  is  to  replace  the  actural  system,  Eq.  (13)  or  (14),  with 
a set  of  equations  of  the  form 

+ ei Pj 

2=1,2,--;”  US) 


where  K.eq  is  an  equivalent  linear  stiffness  constant,  and  e.  is  the 
J J 

error  of  linearization  or  equation  deficiency  term. 

If  this  error  term  e.  is  neglected,  then  Eq.  (15)  is  linear 

J 

and  it  can  be  readily  solved.  The  smaller  that  the  error  is,  the 
smaller  the  error  in  neglecting  it,  and  the  better  approximate  solu- 
tion to  Eq.  (14)  will  be  obtained.  To  this  end,  the  n equivalent 
linear  stiffness  constants,  K.eq,  are  chosen  in  such  a way  that  the 

J 

2 

mean-square,  E[e.  ],  is  minimized.  The  error  of  linearization  is 

J 


e;  = f^’fr £* 

which  is  the  difference  between  Eq.  (14)  and  Eq.  (15).  From  Eq.  (16) 


it  is  apparent  that  ej  depends  upon  the  equivalent  linear  stiffness 
It  is  these  constants  which  will  vary  in  order  to 


constant  K.eq 
J 

2 

minimize  the  n values  of  E[e.  ] requiring  the  following  equation  for 


i 


\;2, 


n 


c/6) 
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-j ^ k 


07) 


where  the  operator  E[  ] denotes  the  statistical  average  or  mathe- 
matical expectation  of  the  appropriate  variables.  Substituting  Eq. 
(16)  into  Eq.  (17)  and  interchanging  the  order  of  differentiation 


. 


L 


and  expectation,  condition  of  Eq.  (17)  reduces  to 

c/8) 

Solving  for  the  equivalent  linear  stiffness  constant  K®q,  Eq.  (18) 

J 


gives 


*7'  * Flip 


(19) 


Note  that  Eq.  (19)  is  not  an  explicit  equation  for  K?q,  since  the 
expectations  appearing  on  the  right-hand  side  depend  on  K^q.  In 
addition,  the  geometrical  stiffness  matrix  which  is  needed  in  the 
numerator  is  not  known  a priori,  and  the  coupling  of  the  modal 
displacements  causes  some  difficulties  in  evaluating  the  expectation. 
Therefore,  one  has  turn  to  simpler  approach  witha  single-mode 
approximation  solution.  Further  study  is  needed  to  evaluate  Eq. 

(19)  numerically  or  by  some  other  means. 

4.  SINGLE-MODE  APPROACH  AND  MEAN-SQUARE  RESPONSES 

For  most  of  the  sonic  fatigue  analyses  in  practice,  only  a 
single-mode  approach  and  linear  small  deflection  theory  is  commonly 
employed.  The  inclusion  of  the  large  deflection  into  the  analysis 
with  a single-mode  approximation  represents  a significant  improvement 
of  the  design  tools  for  complex  structural  panels. 


21 


— ■ 


Thus,  if  a single-mode  approximation  (usually  the  fundamental 
mode)  is  assumed,  and  also  if  the  excitation  pressure  to  the  structure 
is  stationary,  is  Gaussian,  and  has  a zero  mean.  The  expectations  in 
Eq.  (19)  can  be  evaluated,  and  the  equivalent  linear  stiffness 
constant  becomes 

K**”-  K + 3ft]T[Kf]M  £t£2J 

in  which  the  subscript  j has  been  dropped.  Note  again  that  Eq.  (20) 
is  not  explicit  for  <eq,  since  the  expectation  E[£^j  depends  on  Keq, 
and  also  the  geometrical  stiffness  matrix 

which  is  displacement  dependent  is  not  known  a pridri.  Therefore, 
an  interative  scheme  is  introduced  to  determine  Kec*  from  Eq.  (20). 

This  will  be  presented  later.  At  present,  let  us  assume  that  a 
satisfactory  equivalent  linear  stiffness  constant  has  been  found. 

By  dropping  the  error  of  linearization  e(£)  from  Eq.  (15),  the 
single-mode  approximation  solution  of  Eq.  (3)  is  obtained  from 

mc  + ci  + - P c. li) 

«•  $ +2  5°  5 + «■*•> 

where  = (KecVM)^  is  an  equivalent  linear  frequency,  and  5 is  the 
modal  damping  ratio  related  to  the  linear  frequency  u by  Eq.  (12). 

Now  a random  analysis  of  the  modal  equation  (22)  may  be  easily 
carried  out  to  yield  the  PSD  for  the  modal  amplitude  £ as 

SfW-SpM  I H 


in  which  [Sp(Q)]  is  the  cross  spectral  density  matrix  of  the  noise 
excitation  {F},  and  the  frequency  function  H(ft)  is  given  by 


Hcsv 


/V)  ( co^  - SLZ  + a.  2.  <aDS2-  ) 


The  mean-square  response  of  modal  amplitude  is  related  to  S^(n)  by 

E [£ZJ  = J } 

Jo 

f . 


CZS) 


For  lightly  damped  [c,  <0.05)  structures,  the  response  curves  will  be 
highly  peaked  at  u^.  The  integration  of  Eq.  (25)  can  be  greatly 
simplified  if  Sp(fi)  or  [Sp(fi)]  can  be  considered  to  be  constant  in 
the  frequency  band  surrounding  the  resonance  peak,  so  that 


E[£*J 

*..\-WrL Sr'^Jf+l 


(Z6) 


The  covariance  matrix  of  the  nodal  deflections  (q)  can  be  obtained 
by  use  of  the  coordinate  transformation  given  in  Eq.  (15),  then 


rpj-w  wwf 


jA-i-2'-'"’ 


(27) 
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The  diagonal  elements  of  [q.q.  J are  the  mean-square  values  of  the 
nodal  deflections,  and  the  off-diagonal  terms  are  time  averages  of 
products  of  deflections  at  different  nodes.  The  mean-square  node 
deflection  is  simply 

J!-eW 

= -r-  f<PlT[S/^>]fH 


( 2 8) 


in  which  <j>.  is  the  j-th  element  in  modal  vector  {$}. 

J 

The  element  strains  and  nodal  deflections  are  related  by 

in  which  {e}  is  the  vector  of  element  strains,  [S]^  and  [S^  are 
the  strain-deflection  transformation  matrices.  [S]-j  is  the  usual 
transformation  matrix  based  on  linear  theory,  and  [S]^  represents  the 
inplane  strains  due  to  large  deflections.  Eq.  (29)  is  based  on  an 
appropriate  linearization  of  the  nonlinear  strain-displacement 
relations  (Reference  30)  which  will  be  discussed  in  Section  V.  Thus, 
the  strain  covariance  matrix  is  given  by 

- [sijjjj - tfJi  r&T*jrsj,T 


(Z?) 


(3o) 


L\ 


The  diagonal  elements  are  the  mean-square  values  of  element  strains. 
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5.  INTERATION  PROCEDURE  AND  FLOW  CHART 

As  it  was  pointed  earlier  in  Eq.  (20)  that  the  equivalent  linear 
stiffness  constant  Keq  depends  upon  its  response,  which,  inturn, 
depends  upon  Keq.  And  also,  the  geometrical  stiffness  matrix  [K9] 
is  response  dependent  and  it  is  not  known  a priori.  In  this  way  an 
interative  approach  to  final  solution  occurs.  It  makes  no  difference 
at  which  point  in  the  interation  cycle  the  process  begins.  One 
certainly  can  assume  either  equivalent  linear  stiffness  or  responses 
at  the  outset.  The  process  will  converge  to  an  answer  provided  the 
structure  considered  is  a stable  one.  And  all  panel  structures  of 
aircraft  or  space  vehicle  under  sound  environment  are  examples  of  such 
stable  types. 

Suppose,  for  definiteness,  one  desires  to  estimate  the  initial 
equivalent  linear  stiffness  constant  as  K®9.  The  fact  that  both 
E[C2]  and  [K9]  can  be  approximated  using  the  solutions  of  the  linear 
equations  of  motion  (in  Eq.  (26),  weC]  has  to  be  replaced  by  ou), 
facilitates  the  initial  estimate  of  Keq  through  Eq.  (20)  as 

K?-  K +3f*jTr«*J0|>j  £[£ZJ„  c3i) 


This  calculated  initial  estimate  of  K^eq  can  be  used  to  obtain  refined 
estimate  of  E[C2]-|  and  [K9]^,  that  implies  l^69  through  Eq.  (20)  in  the 
same  way  that  Eq.  (31)  implied  K^eq.  As  the  interative  process 
converges  on  the  j-th  cycle,  the  relation 


-K+3f4>f 

T 


(32) 
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becomes  satisfied.  The  number  of  cycles  required  to  attain  convergence 
depends  on  the  nonlinear  characteristics  of  the  structure  [K^],  the 
intensity  of  the  excitation  [Sp(a.eq ) ] , and  the  accuracy  desired.  The 
solution  procedure  is  illustrated  by  a simplified  flow  chart  shown  in 
Figure  2. 

6.  ESTIMATION  OF  IMPROVEMENT 

It  is  a very  fifficult  task  to  give  an  accurate  quantitative 
estimate  on  the  improvement  that  would  be  made  in  predicting  of 
random  responses  using  the  nonlinear  formulation  without  developing 
the  computer  program  and  analyzing  these  problems  given  in  Section  II. 

A very  crude  estimate  of  root-mean-square  (RMS)  deflections,  however, 
is  possible.  The  ratio  of  RMS  deflection  based  on  large  deflection 
nonlinear  theory  to  RMS  deflection  using  linear  theory  is  given  by 


in  which  <d  is  the  linear  natural  frequency,  and  u)eq  is  the  equivalent 
linear  frequency.  If  the  spectral  density  function  of  excitation 
Sp(ft)  is  a slowly  varying  function  with  respect  to  frequency,  Eq.  (33) 
can  be  further  simplified  to 

C J+) 

Let  us  examine  the  data  obtained  in  Reference  6 (see  Section  II 
(2)),  because  this  was  the  only  report  that  deflections  were  measured. 
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Figure  2.  Simplified  flow  chart  for  complex  panel  nonlinear  response 
to  high  intensity  random  loads. 


For  Skin-Stringer  Panel.  The  measured  RMS  deflection  is  0.064 
inch,  or  twice  the  skin  thickness.  Comparison  showed  that  calculated 
panel  deflection  based  on  linear  theory  is  34-percent  high.  The 
rectangular  panel  has  a length- to-width  ratio  of  3.7.  It  was  assumed 
to  be  clamped  along  the  edges  in  the  analysis.  The  ratio  of  linear 
frequency  to  equivalent  linear  frequency  needed  in  Eq.  (33)  or  Eq.  (34) 
can  be  estimated  from  the  following  data: 


Support 

Length- 

U) 

Condition 

Width  Ratio 

Reference 

aieq 

Clamped 

1.0 

31 

0.64 

Clamped 

2.0 

30 

0.58  (extrapolated) 

s 

•l_J* 

c 

3.7 

32 

0.53 

c 

•L_> 

3.7 

32 

0.65 

c 

With  an  approximate  ratio  of  (w/oj^)  s 0.6,  Eq.  (33) becomes 

(RMS  deflec-h'on)NL  ~ 

(RMS  oleflecjian)L  J ^Spfco) 

The  RMS  deflection  based  on  large  deflection  theory  would  be  very 
close  to  the  measured  value. 

Without  analyzing  the  problem  with  computer  program,  it  is  very 
difficult  to  give  a quantitative  estimate  on  the  improvement  in  pre- 
dicting response  strains  using  nonlinear  theory.  The  difficulties 
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come  from  the  transformation  matrices  [S]^  and  [S]2  in  Eq.  (30). 

Therefore,  no  attempt  is  given  for  pursuing  it  further. 

■ 
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V.  GEOMETRICAL  STIFFNESS  MATRICES 


The  geometrical  stiffness  matrix  [k^({6})]  needed  in  the  formula- 
tion has  been  developed  for  various  types  of  elements.  They  have  been 
used  successfully,  in  conjunction  with  the  consistent  mass  and 
stiffness  matrices,  in  problems  of  large  amplitude  vibrations  of  com- 
plex structures.  A brief  review  of  the  advances  in  the  development 
of  geometrical  stiffness  matrices  and  the  associated  large  amplitude 
vibration  problems  is  given  in  this  section. 

Extension  of  the  finite  element  method  to  large  amplitude 
vibrations  of  beams  and  rectangular  plates  was  first  reported  by 
Mei  (References  33  and  34).  The  geometrical  stiffness  matrix  formula- 
tion for  a rectangular  plate  element  in  Reference  34  was  based  on  a 
modified  form  of  the  Berger's  hypothesis  (Reference  35).  Nonlinear 
frequencies,  which  were  obtained  for  rectangular  plates  with  various 
edge  support  conditions,  agreed  well  with  the  approximate  analytical 
solutions.  Results  for  some  boundary  conditions  in  Reference  34  were 
obtained  for  the  first  time. 

One  important  thing  which  has  to  be  mentioned  at  this  moment  is 
that  the  nonlinear  frequencies  determined  from  the  large  amplitude 
vibrational  analysis  using  a "Quasi-Linearization"  technique  have  the 
very  same  physical  meaning,  comparing  Eq.  (14)  with  Eqs.  (21)  and  (22), 
as  the  equivalent  linear  frequency,  u^,  using  equivalent  linearization 
method.  Therefore,  to  gain  a better  understanding  of  large  amplitude 
vibrations  of  plate  and  shell  structures  will  certainly  be  helpful  to 
build  a more  solid  background  for  problems  of  random  vibrations  of 
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complex  nonlinear  structures.  This  has  been  learned  to  be  true  in  the 
linear  case. 

Recently,  Rao  and  his  colleagues  presented  a novel  simplified 
formulation  for  large  amplitude  vibrations  of  beams  (Reference  36), 
rectangular  plates  (References  30  and  37),  and  circular  plates 
(Reference  38).  Their  formulation  is  based  on  an  appropriate 
linearization  of  the  nonlinear  strain-displacement  relations  and  an 
interative  scheme  of  Mei's  (Reference  33)  to  obtain  the  nonlinear 
frequencies.  This  nova!  linearization  technique  was  used  in  the 
strain-deflection  relations  of  Eq.  (29).  Reddy  and  Stricklin 
(Reference  39)  developed  a linear  and  a quadratic  isoparametric 
rectangular  element  to  study  large  amplitude  plate  vibrations.  Most 
recently,  two  triangular  element  formulations  have  been  developed  for 
large  amplitude  vibrations  of  thin  plates  of  arbitrary  planform.  The 
first  one  (Reference  40)  is  consistent  with  the  higher  order  bending 
element  TRPLT1  (Reference  41)  in  NASTRAN  program,  and  the  second 
(Reference  41)  is  consistent  with  the  high  precision  plate  element  of 
Cowper  et  al  (Reference  43).  Nonlinear  frequencies  obtained  for 
numerical  examples  include  rectangular,  circular,  rhombic,  and  isosceles 
triangular  plates. 

Raju  and  Rao  (References  44  and  45)  intended  to  develop  a shell 
of  revolution  frustum  for  nonlinear  vibration  analysis  of  thin  shells 
of  revolution;  however,  their  results  failed  to  predict  the  "softening" 
type  of  nonlinear  behavior  as  discussed  by  Evensen  (Reference  46). 

Geometrical  stiffness  matrices  of  various  types  of  finite  elements 
that  have  been  developed  for  free  vibration  analysis  involving  large 


deflection  nonlinearities  are  listed  in  Table  5. 


TABLE  5 - GEOMETRICAL  STIFFNESS  MATRICES 
OF  VARIOUS  FINITE  ELEMENTS 


Element  Type 

Beam 

Rectangular  Plate 

Rectangular  (orthotropic)  Plate 
Circular  Ring  (orthotropic)  Plate 
Triangular  Plate 

Shell  of  Revolution 


Reference 

Mei33  (1972) 

Rao  et  al.36  (1976) 

34 

Mei  (1973) 

Rao  et  al.30  (1976) 

Reddy  and  Stricklin38  (1977) 

Rao  et  al.37  (1976) 

Rao  et  al.38  (1976) 

Mei  and  Rogers40  (1977) 

4? 

Mei  et  al . (1978) 

Raju  and  Rao44,45  (1975,-76) 
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VI.  QUASI-LINEARIZATION  METHOD 


The  same  physical  interpretation  between  the  equivalent  linear 
frequency  and  the  nonlinear  frequency,  Eqs.  (14),  (21)  and  (22),  leads 
to  the  idea  that  the  method  of  quasi-linearization  may  also  be  applied 
to  problems  of  complex  nonlinear  structures  subjected  to  random 
loads.  The  quasi -1 inearization  technique  has  been  used  successfully 
in  predicting  nonlinear  frequencies  for  complex  structures  (References 
33,  34,  36-40,  42,  44,  45).  Nonlinear  frequencies  of  higher  modes 
can  also  be  determined  by  this  technique  (References  30,  33,  40,  42). 
Those  nonlinear  frequencies,  j,  can  be  employed  to  obtain  an 
approximate  random  response  of  deflections  from  the  relation 

‘r"  1 1 

r-Js-L2*—*'n 

This  method  may  very  well  be  more  promising  than  the  equivalent 
linearization  technique.  However,  both  methods  are  worth  pursuing 
further. 
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VII.  CONCLUSIONS 


1.  SUMMARY  OF  RESULTS 

The  negligen^of  "large  amplitude  nonlinearity"  in  the  analysis 
was  identified  to  be  the  major  factor  that  contributed  to  the  enormous 
discrepancy  between  the  measured  test  data  and  computed  results.  A 
mathematical  formulation  which  is  based  on  the  finite  element  displace- 
ment method  and  the  equivalent  linearization  technique  is  developed. 
Statistical  responses  of  deflection  and  strain  using  a single-mode 
approximation  can  be  expressed  in  terms  of  linear  frequency,  spectral 
density  matrix  of  excitation,  equivalent  linear  frequency,  and  trans- 
formation matrices.  An  interative  scheme  which  is  used  for  determining 
the  equivalent  linear  stiffness  constant  is  presented.  Advances  in  the 
development  of  geometrical  stiffness  matrices  for  various  finite  element 
is  given.  Finally,  a concept  of  applying  the  quasi-linearization 
method  to  problems  of  nonlinear  complex  panel  structures  subjected  to 
high  intensity  noise  levels  is  discussed  briefly. 

2.  RECOMMENDATIONS 

a.  Development  of  Computer  Program.  A computer  program  should 
be  developed  based  on  the  finite  element-equivalent  linearization 
formulation  presented.  The  computed  results  should  be  compared  with 
the  experiments. 

b.  Experiments.  Carefully  monitored  and  controlled  experiments 
with  simple  structures  (such  as  plates)  and  typical  aircraft  panels 
should  be  conducted  at  both  low  and  high  noise  environment.  Measurements 
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of  deflection,  strain,  frequency,  and  pressure  spectral  density 
should  be  precisely  recorded. 

c.  Refinement  of  Finite  Elements.  Refined  finite  element 
representations  should  be  developed  and  incorporated  into  the  computer 
program,  such  refinements,  for  example,  as  higher-order  displacement 
function  for  beam  element  with  various  thin-walled  cross-sections, 
variation  in  thickness  for  plate  element,  shallow  shell  element, 
anisotropic  properties  for  modeling  composite  panels,  etc. 

d.  Multi-Mode  Approach.  The  improvement  in  response  predictions 
using  multiple-modes  as  to  single-mode  based  on  linear  theory  (low 
noise  levels)  should  be  investigated.  Multi -modes  must  be  employed 
in  nonlinear  analysis  (high  noise  levels)  if  the  improvement  were 
found  to  be  significant  in  the  linear  case. 

e.  Quasi-Linearization  Method.  More  research  study  on  this  very 
promising  approach  should  be  conducted. 
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Abstract 

Given  adequate  open-loop  specifications,  for  example,  aircraft  hand- 
ling qualities  criteria,  design  techniques  are  available  to  the  system 
designer  for  synthesizing  even  the  most  complex  flight  control  system. 
Unfortunately,  however,  weaknesses  exist  in  the  handling  qualities 
specification,  particularly  for  "non-conventional"  vehicles  such  as 
control  configured  vehicles  (CCVs).  In  this  investigation,  an  aug- 
mentation method  based  on  optimal  control  techniques  and  closed-loop, 
task-oriented  criteria  was  applied  tb  augment  the  system  dynamics  for 
a man- in-the- loop  air-to-air  tracking  task.  The  cases  of  tracking 
during  level  flight  (longitudinal  tracking)  as  well  as  tracking  during 
a high-banked  maneuver  was  addressed.  The  predicted  tracking  performance 
with  and  without  augmentation  is  compared  to  man-in-the-loop  simulation 
results  and  the  improved  performance  was  significant. 
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I.  Summary  and  Conclusions 


I. 1 Summary 

Given  adequate  open-loop  specifications,  for  example,  aircraft  handling 
qualities  criteria,  design  techniques,  particularly  modem  control  approaches, 
are  available  to  the  system  designer  for  synthesizing  even  the  most  complex 
flight  control  systems.  Unfortunately,  however,  weaknesses  exist  in  the 
handling  qualities  areas,  particularly  for  "non-conventional"  aircraft  such 
as  control  configured  vehicles  (CCV's).  In  this  report,  an  augmentation 
synthesis  method  usable  in  the  absence  of  quantitative  handling  qualities 
specifications  will  be  presented,  and  the  application  of  this  method  to 
augment  the  system  dynamics  for  a man-in-the-loop  air-to-air  tracking  task  will 
be  reported. 

In  the  absence  of  open-loop  specifications,  the  alternative  is  to  design 
the  augmentation  via  closed-loop  performance  criteria.  With  the  use  of  an 
analytical  pilot  model,  the  augmentation  synthesis  and  performance  prediction 
will  be  obtained.  However,  in  the  type  of  problem  being  considered  here  in 
general,  the  selection  of  the  type  pilot  model  is  important.  Due  to  the 
predictive  nature  of  models  developed  from  optimal  control  theory  and  the 
applicability  of  optimal  control  theory  to  multi -dimensional , high-order 
system,  this  pilot  modeling  approach  was  adopted. 

In  the  next  section  of  the  report,  optimal -control  pilot  modeling  is 
highlighted’,  and  an  augmentation  method  previously  developed  by  the  author 
(Ref.  1)  is  presented.  The  method  involves  simultaneously  solving  for  the 
pilot  model  and  augmentation.  Simultaneous  solution  is  important,  and  is 
required  since  the  pilot  is  known  to  vary  his  equalization  and  gain  subject 
to  the  plant  dynamics,  which  is  being  determined  with  the  augmentation  process. 

In  Section  III,  the  method  is  applied  to  a longitudinal  tracking  task, 
and  several  cases  are  investigated.  Two  ranges  and  two  rms  target  acceleration 
levels  are  considered,  with  a lead-computing-optical  sight  (LOOS)  as  well  as 
a perfect  director  sight.  Initially,  the  predicted  performance  (for  both 
sights)  is  compared  with  previously  obtained  experimental  data,  to  establish 
pilot  model  credibility.  Then  a family  of  augmentation  systems  are  developed 
for  all  cases  cited  above.  With  the  full-state  feedback  systems  so  designed, 
significant  tracking  performance  improvements  are  obtained.  Additionally, 
the  system  dynamics  are  appreciably  modified. 

Finally,  as  an  application  of  the  pilot  model  to  an  even  larger  system, 
with  multiple  control  imputs,  the  predicted  piloted  tracking  performance 
for  an  F-106  vehicle  in  a level,  four-g  turning  condition  is  compared  with 
simulated  results  in  Section  IV.  The  simulation  was  recently  performed  in 
AFFDL's  large  amplitude  motion  simulator  (LAMARS)  facility.  The  results 
compared  most  favorably  - a very  important  result  considering  the  pilot  model 
was  a priori  "tuned"  with  very  nominal  values  rather  than  adjusted  to  fit 
the  data. 


2 


A significant  portion  of  this  overall  effort  involved  develoDina  the 

lead^nal^^f^n0?^  for  *he  en9a9ement  geometry,  line  of  sight,  sight 
les»  target  kinematics,  and  vehicle  model  at  a large  bank-anqle 

A thrJigh*D.The  reSUltS  of  this  important  effort  are  reported  in  Applndices 
1*2  Conclusions 

«=-im,.T=S^IaVOra?le  con1Parisons  between  predicted  theoretical  performance  and 

intent^?  ulfs .?as  TOSt  gratifying,  especially  in  the  F-106  case.  The 
intent  of  this  latter  case  was  to  attempt  to  lend  further  credibility  to  the 

arrnlr dh  ]ng  5)[;th°ds  3S  3 whole*  and  1F1  the  author's  opinion  this  was 

S the  lack  ^model  ®xtreme1y  will»  as  ment?oned  above,  consid- 

® ,ck  °f  model  adjustment  necessary.  Also,  only  one  simulator  run  of 

stateXrmrDrpHSKeCOndST^ra^i0n  W3S  obtained  to  compare  with  the  steady 
database  predlct70ns*  Thls  1s  not  therefore,  representative  of  a large*^ 

^.,nn«L^U9nieI!t3t!?n  re^u1ts  were  most  promising.  The  conputer  algorithm 

equations  "anw  Yhl 1_w1Jhout  fla?  ln  the  solution  of  the  coupled  RiOcati 
equations,  and  the  performance  improvements  were  significant  However  to 

proceed  to  a practical  implementation  of  the  methodology,  the  system oer- 

n~bre 5.?-.f£i‘st?£.“r"top’  as.wen  as  "Uh  state™ feedback 

S!,-?  ft  1 These  two  approaches  are  clearly  practically  realiz- 

thl'flS  nep  In  thi^syn thesis  a'pTroX0*’™**10"5  ^ deVel°Ped  C°"stitute 


IX.  The  Augmentation  Methodology 


In  the  absence 
handling  qualities) 
to  be  taken  here  wi 
cost  function  based 
vehicle  is  manually 
explicitly  included 
system  aspects  must 
Ref.  1 . 


of  quantitative  dynamic  specifications  (i.e., 
for  the  vehicle  in  a specific  task,  the  approach 
11  be  to  augment  the  vehicle  such  that  an  objective 
on  the  task  is  minimized.  In  addition,  since  the 
controlled,  the  man-in-the-loop  aspects  must  be 
. Finally,  the  multi-dimensional,  high-order 
be  considered.  The  approach  will  follow  that  of 


II. 1 The  Pilot  Model 

To  include  the  man  in  the  loop,  an  analytical  model  of  the 
human  controller  is  required,  and  the  modeling  approach  to  be  followed 
here  is  to  use  the  optimal  control  representation  of  the  man  rather 
than  a describing  function  pilot  model.  The  major  drawback  of  a 
describing  function  model  is  that  the  pilot's  gain  and  equalization  is 
known  to  depend  on  the  plant  he  is  controlling  (i.e.,  he  is  adaptive), 
and  the  plant  dynamics  are  to  be  augmented,  hence,  are  not  known 
a priori.  Furthermore,  the  optimal  control  model  (OCM)  is  compatible 
with  optimal  control  systhesis  methods,  well  suited  for  multi -dimension- 
al systems. 

As  presented  in  Reference  2 , the  optimal  pilot  model  evolves 
from  the  assumption  that  the  well-trained,  well-motivated  pilot  selects 
his  control  input (s),  u , subject  to  human  limitations,  such  that  the 
following  objective  is  Minimized, 

T 

jp  = e {lim  y / (y  'Qy  + ^pRup  + ^p 


The  dynamic  system  being  controlled  by  the  pilot  is  described  by  the 
familiar  linear  relation 

x = A x + B u + w 
P P P 

y = cx  (A) 

where  x is  the  system  state  vector,  u^  the  pilot  control  vector,  y the 

output  vector,  and  w is  the  vector  of  zero-mean  external  disturbances 
with  covariance 

E[w(t)w'  (t+a)]  = W6 (a) 

Included  as  human  limitations  are  observation  delay,  t,  and  observation 
noise,  v^,.  So  the  pilot  actually  perceives  the  noise-contaminated. 


delayed  states,  or 

y = C x(t  -t)  + V ft-  T) 

'P  P y 

The  covariances  of  the  zero-mean  observation  noise  may  include  the 
effects  of  perception  thresholds  and  attention  allocation,  and  is  denoted 


E[Vy  (t)Vp  (t+cr)  ] = Vy  6 (o) 


Defining  the  augmented  state  vector,  ot  = col  [x,  u ],  the  solution  to  the 
problem,  or  the  pilot's  control  is  given  as  P 


up*  = -G_1[0;  I]Kp* 


where  Kp  is  the  positive  definite  solution  to  the  Riccati  equation 
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It  will  be  convenient  to  partition  Kp  such  that 


K = 
P 


K I K 1 

Pi  I^P2 

I K"  “ 


P3  I P4 


I 


and  note  that  now  the  equation  for  the  optimal  control  up*  is 


(B) 


u 


* = -G_1K 


x - G 


K u* 
P4  P 


A 

or  a linear  feedback  of  the  best  estimate  of  the  state,  x,  and  some 
control  dynamics.  (These  control  dynamics  have  been  shown  to  be  equiv- 
alent to  the  pilot's  neuro-muscular  lag.) 

To  model  the  pilot's  remnant,  motor  noise  is  usually  added  to  the 
control  equation.  The  final  pilot's  control  is  represented  by 


u*  = -G' 
P 


lK  x - G 

P3 


u 


P4  P 


* + G_1K  v 


m 


where 


E[v  ft)v'  (t+cr)]  = VM  5(d) 
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Now,  the  state  estimator  consists  of  a Kalman  filter  and  a least- 
mean  square  predictor,  or 

x(t-t)  = Apx(t-r)+  zcp'V  _1[Y  (t)  -CpX  (t'T)]  + Bpup(t‘T) 


with 


x(t) 


P (t)  + e-V  U(t-x)  -P  Ct-T)] 


P = A P ♦ B u* 

P P P 

and  the  estimation  error  covariance  matrix  E is  the  solution  of  the 
Riccati  equation 

A E+EA'  + W-EC  'V-1C  E = fOl 
P p p y p 1 J 

This  system  of  equations,  when  solved,  determines  the  optimal -control 
pilot  model.  The  actual  numerical  solutions  were  obtained  with  the 
program  PIREP,  documented  in  Reference  3 . 

II. 2 Augmentation  Synthesis  Method 

In  the  determination  of  the  pilot  model  parameters  above,  we  have 
expressed  the  system  dynamics  in  terms  of  the  matrices  A and  B . How- 
ever, since  the  augmentation  has  not  been  defined,  the  augmented  plant, 

Ap  and  Bp  is  as  yet  unknown. 

Consider  the  un-augmented  plant  dynamics  to  be  described  by 

x = Ax  + Bu  + w 

where,  as  before,  x is  the  system  state  vector  and  w is  the  same  distur- 
bance vector.  However,  A and  B are  now  the  un-augmented  system  matrices, 
and  u is  the  control  input  vector.  Now  the  total  control  input  to  the 
plant  will  include  pilot  input,  up,  plus  augmentation  input,  Ug^g>  or 

u = u + u-,c 
p p SAS 

Further,  from  the .pi lot  model,  we  know  that  although  the  feedback  gains 
(e.g.,  G_IK  , G-1K  ) have  not  been  determined,  the  pilot's  control  input 


is  expressible  as 


- -1  - -1  - 
u = -G  K x - G K .u 
P3  P4  P 


(C) 


Now,  the  estimate  of  the  state,  x,  can  be  expressed  in  terms  of  the  true 
state  plus  some  estimation  error, e,  or 


x = x + e 


By  treating  this  error  as  another  disturbance,  Wp,  we  can  write  the  pilot's 
equation  as 

u = -G" 1K_  i - G-1K  U+ 


P3 


P4  ? 
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(Note,  the  disturbance  term,  w',  can  also  include  the  pilot's  remnant 
as  well.)  Combining  this  relation  with  the  plant  dynamic  and  pilot 
equations  we  have 


where 


- _ 
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X 
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x = col [x,  up] 


J (D) 


Wemay  now  proceed  to  determine  an  objective  function  for  deter- 
mining  u^. 

Since  Hess  (Ref.  4 ) has  shown  a strong  correlation  between  pilot 
rating  and  the  pilot's  objective  function  we  clearly  see  that  the  best 
(i.e* , lowest)  pilot  rating  implies  the  lowest  pilot  objective  function. 

Therefore,  for  optimum  pilot  rating  for  a task,  the  control  u„,„  should 
, , . SAS 

be  chosen  to  minimize  J as  defined  in  the  pilot  rating  method.  (This 
method  defines  the  state^and  control  weights,  Q and  R,  as  the  inverse 
of  the  maximum  allowable  deviations  in  the  variables  as  perceived  by 
the  pilot.)  Finally,  to  preclude  infinite  augmentation  gains,  we  must 
also  penalize  augmentation  control  energy.  Therefore,  the  augmentation 
is  chosen  to  minimize 

1 
J. 


Jp  + E (lira  T / u^AS  F u$A$  dt } 


SAS 


T-x» 


or 


JSAS  ■ E T l0  * 5pRip  * * “sAS^AS^' 


and  Q,  R,  and  G are  as  chosen  in  the  pilot's  objective  function,  J 
This  may  be  written  as  P* 


1 T 

JSAS  = E ll1rn  T / (x'px  + Ur 
T-*®  0 J 


SASFuSAS^dt} 


where 


CpOCp  ♦ V G'V  ; Kp  G’V  - 

3 P3 1 P3  P4 


k:  g'Y 

P4  P3 


■t 


! r+k:  g’Y 


and  instead  of  Equation  3 being  substituted  for  u in  the  above  J 
we  have  invoked  a sort  of  separation  principle  anPd  substituted  t he^' 


relation 


un  = -G_1 Kp  x - G_1K  u 
P P 


The  justification  for  this  relation  being  used  lies  in  the  fact  that 
we  wish  to  synthesize  the  augmentation  based  on  how  the  pilot  is  trying 
to  perform  the  control  function  rather  than  on  how  the  pilot  is  capable 
of  doing  so. 

With  this  objective  function  and  the  system  dynamics  given  in 
Equation  D,  the  problem  is  now  stated  in  conventional  form,  except 
K and  K are  as  yet  undetermined  of  course.  If  we  assume,  for 
P3  P4 

example,  full  state  feedback,  the  solution  of  this  problem  is  known 
to  be 


SAS 


-F'1  [B*  J 0]  K, 


SAS* 


or 


u 


SAS 


= -F_1B' 


F B’KSAS  x ‘ F B’K—  u- 


i 


is  the  solution  to  the  Riccati  equation 


SAS2  p 


where 

r v 

Iv 

- 

KSAS  = 

"SASj  | SAS2 

1 

— 

KSAS3 

i sas4 

1 

3 | 
I 


I 

I B 


-G_1Kp 


r 


V . Y 

SAS  SAS 


-g'V 


-G_1Kp 


-P  + K, 


SAS 


rB 

0 


F_1  I6'  ! °]KSAS  - ^SAS 


(E) 
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We  see  in  this  expression  that  the  solution  for  KgAS  obviously 

deoends  on  K for  K and  K ).  Returning  to  the  Riccati  equation  foT 
P P3  P4 

the  pilot  gain  (Equation  g) » we  als0  see  t^at  that  equation  depends  in 
turn  on  the  SAS  gains  (or  KgAS)  since  the  pilot  Riccati  equation  involves 


the  augmented  plant  matrices  Ap  and  Bp. 


As  a result  of  the  SAS  design 


procedure  just  presented,  we  now  know,  however,  the  SAS  structure. 
Returning  to  the  pilot  model,  we  may  now  include  this  SAS  structure 
specifically,  so  that  A and  B (as  in  Equation  A)  may  in  fact  be 
expressed  as  p 

A = A - BF-1B' Kc.c 


B = B Cl  -F-1B'K, 
P 5 


Substituting  these  expressions  in  the  pilot  Riccati  equation  yields 
two  coupled  Riccati  equations,  one  for  the  pilot  gains.  Equation  B,  and 
one  for  the  SAS  gains.  Equation  E.  These  may  be  solved  simultaneously 
for  KgAg  and  K . The  equations  to  be  integrated  are  obtained  by  mul- 
tiplying out  tRe  submatrices  in  Equations  B and  E.  Then  the  elements 


of  the  submatrices  K 


S>AS  ’ etc‘  are  t^len  f°un<i  by  integrating  the 


resulting  8 matrix  equations  backwards  in  time. 

Now  the  pilot  model  is  used  to  evaluate  augmented  system  performance 
with  the  plant  matrices  (A^  and  B^)  found  from  Eqn.  F. 


Ill  Application  to  Longitudinal  Tracking - 
LCOS  and  Director 


The  initial  application  of  the  methodology  involved  the  aug- 
mentation of  the  tracking  task  in  the  longitudinal  axis  only.  This 
piloting  task  was  first  investigated  with  the  optimal  pilot  model  by 
Harvey  and  Dillow  (Ref.  7 )•  In  the  cited  investigation  the  dynamic 
model  included  the  longitudinal,  short -period  approximation  for  four 
different  aircraft  and  a lead-computing  optical  sight  (LCOS) . The  task 
was  experimentally  investigated  in  the  laboratory  using  an  analog  simu- 
lator equipped  with  oscilloscope  (display)  and  force-sensitive  stick 
control.  The  optimal  pilot  model  was  then  applied  to  "predict"  the 
manual  control  performance.  (Actually,  in  this  case  the  experimental 
data  was  used  to  tune  some  of  the  pilot  model  parameters  and  compare 
predicted  performance.) 

IIU  PIREP  VALIDATION 

To  assure  the  validity  of  the  PIREP  program  a set  of  cases  consid- 
ered by  Harvey  were  evaluated  with  the  PIREP code.  The  math  model  develop- 
ment for  this  case  is  as  follows. 


The  E-4E  aircraft  dynamics  at  an  altitude  of  15,000  ft.  and  a MACH 
number  of  0.9  (U#  = 952  fps)  were  used.  The  stability  derivatives  in 
this  case  are 

-1.033  U*  (ft/sec2)  M*  = -.344  (rad/sec) 

"Z$  = -.0951  ua  (ft/sec2)  = -.738  (rad/sec) 

10.44  (rad/sec2)  M =-37.1  (rad/sec2) 

In  addition,  the  elevator  dynamics  were  modeled  in  this  case  with 
the  relation 

S£  s ~z°-  y/«* 

With  the  above  parameters,  the  longitudinal  equations  for  the  short 
period  approximation  are  for  this  case 


<$_  = -206-+16.  Sstick 
E E 
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The  trim  condition  is,  of  course,  level  flight  (^=^L=P1=Q1=R1=V1=W1=0) 
^=10°  (Ref.  8 )• 

As  can  be  verified  from  Equations  H in  Appendix  B,  the  sight 
equation  for  the  perturbation  elevation  lead  angle,  Kf/_ , is 

where  the  steady  state  lead  angle  and  line  of  sight  are  assumed  zero, 
and  the  attacker’s  normal  acceleration,  az,  is  approximately  equal  to 
Z Furthermore,  in  this  case  the  attackers  and  targets  steady-state 
velocities  (e.g.,  Uj,  Vj)  and  specific  accelerations  (A^m,  A^,  etc.) 
are  assumed  equal.  AThe  ballistic  coefficient,  Jv,  was  estimated  from 
a relation  given  in  Harvey  (Ref.  7 ) as 

Jv  = .0382 

(Note  that  the  relation  in  Harvey  Ref.  7 does  not  agree  with  the 
definition  of  Jv  given  in  Ref.  5 •)  Finally,  the  time  of  flight, 

, was  estimated  with  the  method  given  in  Ref.  using  a muzzle 
velocity,  Vj,,  of  3400  ft/sec.  Since  two  ranges  (0=  1000ft  and  3000  ft) 
were  considered,  two  times  of  flight  were  estimated.  They  were  T~=.327 
sec(Vf=2110ft/sec)  for  D=1000ft,  and  T£=1.30  sec(V£=1350ft/sec)  for 
D= 3000ft. 

The  relation  used  by  Harvey  for  the  pipper  error, e , is 
e = X -Ba 


as  shown  in  the  sketch,  where  8^  is  the  relative  line-of-sight  angle. 

.weapon  line 


Display  Sketch 

It's  dynamics  were  described  by  the  relation 


where  Bj 


*a  " 6 - »i 

is  the  inertial  line  of 


sight  rate,  and 

) 


In  the  above  relation,}^  is  the  target  flight  path  angle 

and  a is  the  target's  normal  acceleration  described  by  the 
ZT 


Markov  process 


V 


/ 


- ar  * n> 


The  target  time  constant  chosen  was  3 seconds  and  O'  was 
selected  as  3.5  and  5.0  g's.  Since  (Ref.  ) i 


f * 


t 


selection  of  (T  specifies  the  noise  statistics.  The  math  model 
®T 

for  four  different  conditions  is  now  complete;  two  distances,  1000 
and  3000  ft,  and  two  target  acceleration  levels,  3.5  and  5.0  g's 
rms . 

The  pilot  model  parameters  used  are  given  in  Table  1 and  the 
rms  performance  comparisons  are  given  in  Table  2.  As  a result  of 
the  close  agreement,  PIREP  was  considered  satisfactory.  A sample 
output  is  given  in  Appendix  E . 


Table  1 

Pilot  Model  Parameters 


Observation  delay,  V = o.2  sec. 

Neuromuscular  time  constant,  .1  sec. 

Observation  vector,  Y'»  [e,e,A,A] 

Cost  function  weights,  Qy^  = [16.,  1.,  0.,  4.] 

Qu  = 0. 

— 2 

Observation  noise,  Vy^  = irpwy  ;p  = .01 

2 

Motor  noise,  VM  = irpOJj  ; 0=  .0015  (Harvey) 

^)  = .001  (Schmidt) 

Observation  thresholds,  Th  = Th,  = 0.63° 

Th  =Th,=0.65° 

C A 

Full  Attentional  Allocation 
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Table  2 

Performance  Comparison  - LCOSS 


Pipper 
Error  1 

RMS  Parameter 

Lead 

£ U*t)  Angle, 

Pitch 

Rate,  q (deg/ sec) 

. J M 

Elevator 

Se  (deg  ' i 

1000  ft  gy=  3.5  g 

• -*  1 

Experiment 

2.1 

2.9 

8.0 

2.1 

Harvey 

2.3 

2.9 

8.8 

. i ] 

2.1 

Schmidt 

2.3 

2.9 

8.7 

2. 1 

iooo  ft  <rT=  5 g 

) 1 

Experiment 

2.7 

4.0 

11.2 

3.0 

Harvey 

3.0 

4.2 

12.4 

3.0 

Schmidt 

i 3.0 

4.2 

12.1 

) I 

3.0 

3000  ft.,  (Tt=  3. 5g| 

/ s 

Experiment 

2.4 

8.7 

6.4 

1.8 

Harvey  1 

2.4 

9.2 

7.6 

1.9 

Schmidt 

2.3 

10.1 

7.3 

1.8 

3000  ft,  ay  5 g 

I j 

Experiment 

2.7 

12.1 

9.2 

2.4 

Harvey 

i 

2.9 

13.0 

10.5 

2.6 

Schmidt 

2.7 

14.2 

10.0 

2.5 

■T 1 1-2  Augmentation  of  Tracking  with  LCOS 


As  shown  in  Section  II  , the  augmentation,  assuming  full- 
state  feedback,  is  determined  from  the  relation 


or 


USAS  * - f'b  X - F 'S‘Ksv7  “p 
U fjif  s ~ Ajlr  A ~ ^ ' < *i£ 


r4  ° stick 

where  the  system  dynamics  are  described  by 

j?  = AY  + 


C. 


In  the  longitudinal  tracking  case  previously  discussed,  the 
state  vector  may  be  chosen  as 


x'* 


artj  *V  >px , A , <x,  O,  g* , Se] 


and  the  pilot's  control  is  u = Jstick.  After  selection  of  the  SAS 
weighting,  F,  the  SAS  and  pilot's  Riccati  equations  are  solved  simul- 
taneously, as  noted  in  Section  II,  to  determine  K 


SAS, 


and  K, 


SAS- 


The  gains  K and  Kp  in  Eqn.  C are,  of  course,  now  available. 
x °s 

For  the  LCOS  tracking  cases  presented  in  the  previous  section, 
a family  of  gains  (Kx  and  K<f)  are  given  in  Table  3.  Note  that  ^ince 
the  two  tracking  destances  are  considered,  we  have  two  sets  of  system 
dynamics,  hence  two  sets  of  gains. 

The  augmented  plant  matrices  are 


4,-  A-Sk i 
rU-K,) 

The  eigenvalues  of  the  plant  matrix  Ap 


are  shown  in  Figures  1 and  2. 


1 3 ) -H.CiHU  j 


l COS  '/COO Ft 


It'//  JS-£/-(f) 


doom's 007 
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The  predicted  performance  (pipper  error  and  elevator  activity) 
of  the  piloted  system  is  depicted  in  Figure  3 as  a function  of  aug- 
mentation level  for  both  a 3.5g  and  5g  rms  target  acceleration. 
Als^  shown  for  reference  is  the  unaugmented  system  performance 
(F~  =0.).  Clearly,  with  increased  augmentation  level,  tracking 

accuracy  is  reduced  while  total  elevator  activity  remains  almost 
constant. 

This  last  result  was  somewhat  surprising  until  an  analysis  of 
the  various  components  of  commanded  elevator  was  performed.  Recall 
that  the  SAS  control  input  is 


<T stick 
**  pilot 


where  UgAg,  analogous 
stick  input.  Therfore. 


to  the  pilot's  control,  is  an  equivalent 
the  commanded  elevator  deflection,  , 


^ £ti 


stick 


is 


where  Kg  is  a gearing  ratio  (.8  in  this  case).  Now  wt  write 

Since  our  results  showed  positive  values  for  ,K£  we  see  that  the 
SAS  is  cancelling  some  of  the  pilot's  commanded  elevator. 

A plot  of  the  total  elevator  deflection,  pilot  effective  com- 
manded elevator  deflection  fl-Kr ) K_  £ and  effective  SAS  eleva- 

* 9 ST 

p 

tor  deflection  -K  K^x  is  shown  in  Figure  4.  And  the  total  pilot's 
stick  activity, j is  shown  in  Figure  S for  the  1000ft.,  5g  case. 
P 


As  can  be  seen,  the  augmentation  input  is  simply  replacing  the 
pilot's  input.  And  since  in  this  analysis  full-state  feedback  and 
no  augmentation  sensor  errors  are  assumed,  the  SAS  will  always  be 
more  precise  than  the  pilot.  A first  estimate  of  the  optimal  gains 
would  be  the  case  for  F'1  = .01. 


I I 13  Tracking  and  Augmentation  with  Perfect  Director 

As  discussed  in  Appendix  B as  well  as  Ref.  S' , a perfect  director 
sight  is  based  on  the  assumption  of  perfect  knowledge  of  the  tar- 
get's acceleration  and  the  line-of-sight  rates,  rather  than  using 
the  attacker's  parameters  to  estimate  these  values  as  in  LCOS.  The 
effect  is  a sight  with  much  less  dynamics.  To  investigate  the 
effect  of  a different  sight,  the  analysis  was  also  performed  with  a 
director. 


I 


i 


In  this  case,  the  equation  for  the  lead  angle  (Eqn.  B)  is 
replaced  by  the  relation 

T*  P* ' sir*  * J*  % *■ 

As  a result,  the  state  vector  becomes  r 

[*’,  «tc  , vT/  pr,  *,  e,p 

and  the  output  vector  is  still 

with  c appropriately  defined  from  the  state  equations  previously 
presented.  The  same  pilot  model  parameters  as  given  previously  in 
Table  1 were  used  as  well.  A comparison  of  rms  performance  between 
the  LCOS  (as  in  Table  2)  and  a perfect  director  is  given  in  Table  4. 

Table  4 

Sight  Performance  Comparison 


RMS  Parameter 


Case 

Pipper 

Error, 

Lead 

e (deg)  Angle, X (deg) 

Pitch 

Rate,  q (deg/sec) 

Elevator 
Def  1 . (de: 

1000ft. , 3. 5g 
LCOSS 

2.3 

2.9 

8.7 

2.1 

Director 

1.6 

2.6 

7.9 

2.0 

1000ft. ,Sg 
LCOSS 

3.0 

4.2 

12.1 

3.0 

Director 

1.9 

3.7 

11.0 

2.9 

3000ft. ,3.5g 
LCOS 

2.3 

10.1 

7.3 

1.8 

Director 

1.1 

9.7 

6.0 

1.6 

3000ft. ,5g 
LCOS 

2.7 

14.2 

10.0 

2.5 

Director 

1.4 

13.8 

8.5 

2.3 
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The  pilot-optimal  augmentation  analysis  was  performed  for  the 
director  cases  as  well,  and  the  optimal  gains  are  given  in  Table  5, 
where,  as  in  the  LCOS  case, 

Mfu  ’ - Kx?~  fa 

Kf  - 

Further,  the  augmented  plant  matrices  become 

Ap  ■ A-  B fa 
Br*  B(t-fa) 

and  the  plant  eigenvalues  are  shown  in  Figures  6 and  7. 

Comparing  these  figures  with  Figures  1 and  2 indicates  large 
differences  in  the  effect  of  augmentation  on  the  vehicle  short 
period  mode.  This  important  difference  highlights  the  effect  of 
total  system  dynamics  (pilot .^sight,  etc.)  on  the  desired  vehicle 
dynamics.  In  the  case  for  F = .01,  for  example,  with  the  LCOS  a 
significant  change  in  short-period  damping  with  little  change  in 
natural  frequency.  However,  with  the  director,  the  natural  fre- 
quence is  measurably  increased.  Finally,  changing  the  LCOS  sight 


time  constant  ( Yf  ) significantly  altered  the  eigenvalue  locus. 

As  can  be  seen  in  Figure  8,  finally,  the  tracking  performance 
is  significantly  improved,  although  not  as  dramatically  as  in 
LCOS  (Figure  3),  and  the  elevator  trend  is  as  before. 


Optimal  Gains  - Perf.  Director 


Ope* t " 


-^1 

5 ■ T ? „ 

^1,  wA 

'C  u?’vJ5 
Q ^ N (v^ 

» 

*< 


GU/a-for 


sue* 


IV.  F-106  Tracking  Analysis 


As  a final  demonstration  of  the  optimal  pilot  model  and  augmentation 
methodology,  an  analysis  of  a maneuvering  flight  case  (rather  than  level 
flight)  was  initiated.*  The  test  case  chosen  was  an  F-106  aircraft  track- 
ing in  a level,  4-g  turning  condition  using  a perfect  director  sight. 

This  high-g  high  bank  angle  condition  represents  a case  in  which  the 
lateral-directional  motions  and  longitudinal  motions  are  not  uncoupled, 
hence  a large-order,  multi-input  multi-output  system  is  considered. 

This  particular  case  was  not  chosen  randomly  for  analysis.  Recently, 
a piloted  simulation  study  of  this  vehicle  with  various  sights  was  com- 
pleted in  AFFDL's  LAMARS  facility.  Hence,  simulation  data  is  now  available 
for  comparison. 

A significant  portion  of  this  total  effort  was  involved  in  the  develop- 
ment of  the  perturbation  math  model  of  this  engagement.  Presented  in  the 
Appendices  are  the  results  of  this  modeling  effort,  and  the  necessary 
remaining  models  will  be  presented  here.  The  pilot  model  parameters  selected 
will  be  summarized,  and  the  performance  comparisons  will  be  presented. 


IV.l  Model  Development 


The  general  equations  for  the  line-of-sight,  lead  angle,  target  kine- 
matics, and  F-106  aircraft  dynamics  are  developed  in  Appendices  A » B , 
C,  and  D respectively.  The  flight  condition  of  a 4-g,  level  turn 
defines  the  equilibrium  (steady-state)  conditions  for  the  vehicle.  In 
this  case  we  have 


U-|  = 775  ft/sec 

V-|  = 0.  ft/sec 

W.|  = 0.  ft/sec 

$1  = 75.5  deg 

Az  = -4g  = -128.8  ft/sec2 

5e  = -20.2° 


± = .161  rad/sec 
P.|  = 0.  rad/sec 
Ql  = .156  rad/sec 
R^  = .040  rad/sec 
B-j  = 0.  deg 
^ = 14.5° 


Now,  unlike  the  level  tracking  case  presented  in  the  previous  section, 
the  steady-state  line  of  sight,  lead  angle,  target  acceleration,  etc.  are 
not  zero,  and  must  be  determined.  From  Appendix  A , the  steady-state 
azimuth  LOS  must  satisfy  the  relation 


* Unfortunately,  time  did  not  permit  the  augmentation  analysis  to  be 
completed  here  in  the  summer's  efforts.  This  is  a planned  future 
research  activity. 
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1 i 


- sin  a-j  (Wj 


W,)] 


51BA2  = 0 = " V ' BAztcos  “l^  ' V , (] 

- Dj[R.|(cOS  a1  - Be1  sin  a^)  + P^sin  a-|  + BE1  cos  a-])]  (P J 


Now,  from  geometry,  we  can  write  the  expressions  for  the  target  velocity 
in  the  attacker's  stability  coordinate  system  as 


Jy  = Vy  COS  Al£_ 


i/y^  = Vy  sin  Ajj^  cos  4>a 


where 


Wy  =-Vy  sin  a^  sin  $A 
Vy  = target's  velocity 


A£  = target-attacker  heading  difference  (i/»y  - \ |»A) 
$A  = attacker's  bank  angle 


Substituting  into  equation  A and  noting  that  in  steady  state  V.  = VT  and 


0 yields 


R,°l 


(cos  a-j  + sin  a-|  B£L)  = -BAz  COS  COS  All;  - 1.) 


+ sin  A£(cos  <t*i  - BAz  sin  sin 


(B) 


Now,  assume  a nominal  tracking  range  D-,  of  2000  ft,  sin  a,  BFI  « cos  a., 
and  BAz  = 0 yields  1 1 1 


sin  Aip-j  = .4 

A^-j  3 23.6° 


Therefore,  the  targets  velocity  and  accelerations  are  (see  Appendix  C ) 

2 


U, 


'1 


'1 


710  ft/sec 
78  ft/sec 


= -50.  ft/sec 


'1  . 


0.  ft/sec 


WT  =-320  ft/sec 


yl  2 

At  1 =-118.  ft/sec 


(c) 


'1 


1 


Where  the  target  is  assumed  to  be  in  a level,  4-g  turn  also. 


Now,  the  bullet  time  of  flight,  lead  angles,  and  line  of  sight  may  be 
estimated.  From  Ref.  5 , the  projectile  flight  distance,  time  of  flight, 
average  velocity  and  attacker  velocity  are  related  by 


yvA  * vf)  = d, 


and 

where 


)f  = D + (VA  + D)  Tf 


D is  the  present  range. 


" 


l 


J 
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Therefore,  the  average  flight  velocity  is 
Vf  : D/Tf 

Finally,  using  Ref.  5 , if  V 2-  = 3300  ft/sec,  the  time  of  flight 

is  estimated  as  T*  = .786  sec,  ..  . . , ,oc  r 

f Vf  5 2545  ft/sec,  and  Jv  = .185.  From 

Appendix  B » the  steady  lead-angle  equations  for  the  director  are  then 
Lel(2545)  = Bel(5.2)  + 323. 

If,  for  zero  pipper  error,  BEL  = LEL>  we  have  LEL  = B£L  = .127  rad. 
Likewise  for  azimuth, 

LAz(-2545)  = -BAz(5.2)  + 78 

If  Ba  : La  we  have  LA  = -.031  rad.  Re-iterating,  assurmring  zero  steady 
state  pipperlerror,  we  nave 

LEL  = BEL  = *127  rad 

Lflz  = BA2  ' -031  rad 


With  these  steady  values,  and  those  given  in  Equation  C,  we  now  have 
specified  the  coefficients  of  the  perturbation  equations  for  range  and 
line-of-sight  rate  (Appendix  A },  lead  angles  (Appendix  B ),  attacker 
kinematics  (Appendix  C ),  and  vehicle  dynamics  (Appendix  D ). 

Now,  choose  the  seventeen-element  state  vector  as 


x = [n 1 9 ay,  n£»  <J*y»  Aip , Vj,  Uy,  d,  8E-|»  $Az*  *7’  P 

and  the  equations  just  cited  can  be  used  to  form  the  linear  model 
7=Ax  + BUp  + Ew 


with  U'  = [6f 

p stick 
and  target  noises 


stick 


w'  = [?i , 0,  ^2>  0*  •••] 

(Note  that  the  rudder  pedal  has  not  been  included  as  a control. 
Discussions  with  fighter  pilots  indicate  little  rudder  usage  in  this 
tracking  task.  This  addition  could  be  made  later  however.  Also,  note 
that  the  attacker  and  target  perturbations  U and  Uj  are  omitted  to  limit 
the  order  of  the  state  vector.) 


With  this  model,  and  steady-state  conditions  as  defined,  the  system 
eigenvalues  are  as  follows: 





rx 


29 


Short  period. 

- 0.91  + 3. 33j 

Dutch  roll , 

- 0.43  + 4. 87j 

Roll, 

- 1.0 

Spiral, 

- .002 

Target  acel . , 

- 1.0  + O.j 

Target  roll , 

-1.0  + O.j 

Lead  angles, 
line  of  sight 
and  kinematics 

f -.005  + . 16j 

j -.006+.16J 
(.  3 remaining  = 0. 

IV. 2 

Performance  Comparison 

Two 

perfect 

different  simulation  runs  were  performed  in  a level  turn  with  a 
director  sight.  Since  the  steady-state,  or  mean  values  of  the 

performance  parameters  differed  between  the  two  runs,  as  shown  in  Table 
6 , it  was  decided  to  disregard  run  number  109  results.  The  justification 
for  this  is  that  the  mean  values  from  this  run  did  not  agree  with  the 
stated  flight  condition  (e.g. , for  a level,  4-g  turn,  trim  bank-angle 
$1  .is  75.5°). 

The  parameters  selected  for  this  pilot  model  are  shown  in  Table  7 . 

It  should  be  noted  that  these  parameters  are  nominal  values  used  frequently 
in  other  pilot  modeling  investigations  (see  Hess,  Ref.  B ),  except  that  the 
output  weighting  matrix,  Qy,  is  the  same  as  that  used  previously  in  the 
longitudinal  tracking  analysis. 

Table  S 

Steady-State  Parameter  Comparison 

I 


Lead  Angles 

*EL  XAz  (r*d-) 

Target  Velocities  (gun  coord.) 
»T(fps)  uT(fps) 

Run  109 

Run  133 
Theoretical. 

.025  -.028 

.123  -.048 

.127  -.031 

27.  -65. 

100.  -312. 

78.  -302. 

Euler  Angles  (body) 

♦r  e,  (deg) 

Body  Rates 

Pj  Q1  R,  (deg/sec) 

Run  109 

Run  133 

Theoretical 

28.9  3.4 

72.6  3.6 

75.5  3.6 

38.0  2.9  2.6 

- .6  8.3  2.5 

- .6  8.9  2.2 

Velocity  Angles 

a1  (deg) 

Stick  Deflections 
«c  6,  (in) 

»t  *st 

Run  109 

Run  133 

Theoretical 

3.8  0.6 

10.9  0.6 

14.5  0. 

-1.  -1. 

-?.3  - .4 

-1.0  -0. 
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The  observation  (output)  vector  includes  pipper  error  and  lead  angles,  as 
before,  plus  the  target  bank  angle.  This  angle  is  considered  an  important 
pilot  cue,  but  because  of  the  range,  its  observation  threshold  is  increased. 


Table  7 

Pilot  Model  Parameters 


Observation  delay,  t = .2  see 
Neuromuscular  lag,  tm  * = .2  see 

Motor  noise,  Vu  = C2U  , fz  = fa  = .01  (-20  dB) 

Observation  noise,  V = > Pi  = .01(-20dB) 

yi  n”  yi  1 

Fractional  attention,  0.5  to  longitudinal  and  lat-dir  axis,  fi  = .5, 

all  i 


__  • • • • • 
Output  Vector,  y - eAz*  ^El’  ^El’  ^Az*  ^Az*  ^T*  ^T^ 

Observation  thresholds,  .05  deg,  .1  deg/sec 

except  for  <f>j  - 5 deg,  10  deg/sec 

Output  weights,  Qy;  Qe  = 16. , Qe  = 1 . , QX  = 4. 


The  angle  off  (=A^)  was  not  included  because  in  this  in-plane  encounter, 
the  pilot  was  not  considered  to  perceive  this  variable  accurately. 

Before  comparison  of  the  rms  performances,  it  is  important  to  point 
out  one  fundamental  difference  between  the  simulation  and  the  model  pre- 
diction method.  The  model  is  structured  by  assuming  the  pilot  is  performing 
a regulator  task,  while  the  target  is  randomly  maneuvering.  On  the  other 
hand,  this  was  not  the  situation  simulated,  the  task  simulated  was  a 
capture  and  track,  that  is,  it  was  more  a variable  initial  condition  task. 
Furthermore,  the  simulated  target  exhibited  essentially  no  rms  accelerations 
and  bank  angle  about  the  mean.  Consequently,  the  model  target  accelerations 
(in  this  case,  bank  angle)  must  simply  be  guessed,  then  parametric  comparisons 
made  with  these  facts  in  mind.  One  must  compare  relative  to  Vy  and  wj  which 
can  vary  greatly  with  the  time  constants  selected  in  the  target  acceleration 
and  bank  angle  noise  model.  And  these  time  constants  must  be  selected 
purely  on  judgment.  Clearly,  after  comparison,  target  time  constants 
could  be  adjusted,  but  the  intent  here  was  to  be  "predictive"  as  possible. 

The  performance  is  given  in  Table  8 . It's  felt  that  the  agreement  is  more 
than  satisfactory. 


Simulated 

r = 5° 

rJ  - 7° 

*T 


r = 50 

9t 

sr  • 7° 


imulated 

♦t  *5' 

r » 7° 


imulated 
= 5° 
7° 


a 


Table  8 

RMS  Performance  Comparison 


Target  Motion  _ 
^(fpsi  °*y(deg) 


Tracking  Errors 


£rn  (rad) 


Euler  Angles  (body) 

e (deg) 


Velocity  Angles 
a $ (deg) 


* </A  a»j  r!»t  Co*s-f*n+f- 


Target  Velocities* 
yCTj  ytiTj  (ft/sec) 


• 

0.3 

34. 

• 

5.0 

CO 

CVJ 

• 

7.0 

38.  1 

Lead  Angles 


XE1  (rad) 


Body  Rates 

q r (deg/sec) 


Stick  Deflections 
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*T  - VA  ’ 

But  the  time  rate  of  change  of  a unit  vector  is  the  cross-product  of  that 
vector  with  its  rate  of  rotation  vector  in  inertial  space,  — , or 
• • 

? = 8 X S 

So  T is  the  rate  of  rotation  of  the  LOS  in  the  inertial  (e.g.  earth- 
fixed)  coordinate  system. 

Taking  the  cross-product  of  Eqn.  A with  s yields 

s X ( VT  - VA)  = sfi  Ds  + s X (8  X 7 ) 0 

And  from  the  vector  triple  product  identity 

S X (6  X ?)  = B - (S  1 s)  s 
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Now,  instead  of  the  inertial  LOS  rate  g,  we  desire  the  LOS  rate  in 
the  attacker's  gun  coordinate  system  °r  jgun>  which  is 

8 = g - a) 

gun  p -g 

where  “ in  the  rotation  rate  vector  of  the  gun  coordinate  system.  Sub- 
stituting into  the  above  yields 

“gun  + “g  ' ^gun  + »g>  ‘ ^ 1 * * < V V 

or 

“gun  = X <VT  ' V - “g  + <V  ' ?>?t  <»g  ' 

Now  the  term  (?cL,_  • s’)  * 0 since  the  rotation  vector  and  the  line  of 
sight  in  the  gun  coordinate  are  orthogonal. 

The  necessary  vector  equation  is  then 

6gun  = D^s  X^VT  “ VA^  " “g  + ^“g  ’ s^ 

From  the  figure  on  page  1,  we  can  define  the  vectors  as  follows,  using 
small  angle  approximations 

J * Tg  + 6AzTg  +_eE1*g 

sg  " -E&  * eAzkg 
a)  = P T + Q j + R K 

g g g g g g g 

vA  = uATg  + vAJg  + wAkg 

VT  = uTgTg  + vTIg  + uTVg 

(It's  important  to  note  that  Yj  and  the  components  here  defined  are  the 
velocity  components  of  the  target  in  the  attacker's  gun  coordinate  system.) 
Carrying  out  the  operations  in  Equation  r.,  and  assumming  u>q  • s : P , we 
have  the  desired  relations  9 9 

6^1  = “ Wrt)  - 8ct(Ut  - Un)]  + Qn 


El'  Tr 


“Az  ■ !>«%  - V - eAz(uTg  ■ ug)]  - Rg  + Vei 

To  obtain  the  equation  for  the  rate  of  change^  of  range,  or  D,  we  can 
simply  take  the  dot  product  of  Equation  A.  with  s to  obtain 

• — + * — 

(VT  - VA)  • s = Dts/**  T)  + 0(6  x^T  • s 

so  . _ _ 

D = (VT  - VA)  • s 

o * (%  - Kg)  + <*T,  - V6Az  * * “g)8El 
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A .2  Seal  or  Equations  in  Stability  Axes 

Since  all  our  equations  are  ultimately  expressed  in  the  vehicle  stability 
axes,  we  need  to  express  the  above  equations  in  terms  of  stability-axes 
variables  rather  than  gun-coordinate  variables. 

Assuming  the  gun  axis  to  be  aligned  with  the  vehicle  body  reference 
axis,  we  have  the  following  relations  between  gun  (-)g  and  stability  (*)s 
variables 


U$  cos  a-|  - W$  sin  ct-j 


V = V 
9 s 

W g = Ws  COS  a-j  + U$  Sin  a-j 


R$  sin  c*i 


Pg  = Ps  COS  a, 

Q = Q 

ys 

Rg  = Rs  cos  a-|  + Ps  Sin 

The  resulting  equations  are 

D = [(UT  - Us)  cos  ct1  - (WT  - W$)  sin  ] 

* 6az<VTs  - V 

+ 3E1I(WT  - W$)  cos  a1  + (UT  - Us)  sin 

6E1  = 1^WT  " V cos  al  + ^UT$  -.Us)  sin  al 

- eE1{(UT  - Us)  cos  a1  - (W-^  - W$)  sin  o^}] 

+ Qs  - 6Az(Ps  cos  ai  " Rs  si 

SAz  * - Vs>  - 6AZ«UTS  - Us>  c°s  «1  - <“ts  - “s>  sin  »1>1 

- (R$  COS  a-|  + P$  sina^)  + 6el(P$  cos  a-|  - R$  sina^ 

where  Uts,  VT$,  WTs  are  now  the  components  of  the  target  velocity  in  the 
attacker's  stability  axis  and  a-|  is  the  attacker's  trim  angle  of  attack 
(or  the  trim  angle  between  the  gun  axis  and  stability  axis). 


A .3  The  Perturbation  Equation 

Now  to  linearize  the  above  qquations,  we  will  define  the  motion  variables 


in  terms  of  a constant,  steady-state  quantity  (•)}  and  a time-varying  per- 
turbation about  the  steady-state  (-)l  variable.  5o,  let 


D = D1  + d 

8el  * Bai  + 


8Az  " BAzl  + eAz 


U$  = U1  + u 


Vs  = V!  + v 
Ws  = W1  + w 


Pc  = pi  + P 


Qs  = Q]  + q 
Rs  = Ri  + r 


Inserting  these  into  Equations  E will  yield  two  sets  of  equations, 
one  for  the  steady  state. values  and  one  for  the  perturbation  quantities. 

For  example,  taking  the  6^  equation  we  have 

(D]  + d) (Be1  ^ + eE1)  = [(W.^  - W^  cos  ^ + (UTi  - 1^)  si  n^] 

+ [(Wy  - w)  COS  a-j  + (u-j-  - u)  sin  a-|  ] 

t ' -(B^i  ^ -j ) t ( U-p  - U-j ) COS  a-j  - (W-j-^  _ ^] ) sin  a] 

+ ( u j - u)  cos  a-j  - (\Hj  - w)  sin  o^] 

+ (D1  + d)(Q1  + q)  F 

- <°1  * d»BAz,  + 6^Z)[(P1  + p)  cos  - (R]  + r)  sin  0]] 

Now  the  steady  state  ( )1  values  certainly  must  obey  the  original  equation, 
s0  . o iy 

B1?E1  = t(wj  " Wl^  C0s  “l  + ^UT-|  " S1n  al^ 

- Be1  [ ( U-j-  - l^)  cos  o1  - (WT^  - W^  sin  c^] 

+ D-jQ1  - D-|  BAz  [P1  cos  a1  - R-|  sin  G. 

We  may,  as  a result  subtract  this  relation  out  of  the  complete,  original 
relation  (Eqn.  F).  In  addition,  under  the  assumption  that  the  perturbation 
variables  are  small,  we  will  drop  higher  order  terms  (e.g. , products  of 
pert,  quantities).  We  are  left  with 

D18E1  + d ?E1  = Kwt  " w)  cos  ai  + (ut  ■ u)  S1*n  ai J 

- Bej  [(ut  - u)  cos  o1  - (wT  - w)  sin  ] 

- Be1[(Uj  - ) cos  - (WT  - W-j)  sin  c^J  H. 

+ D1  q + Qi d - d BAz  (P]  cos  - R-|  sin  a-|) 

- eAz  D-j (P-|  cos  - R1  sin  a])  - D]BAz  Cp'  cos  a]  - r sin  a 
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This  equation  is  now  linear  in  terms  of  the  perturbation  variables, 
and  the  steady-state  variables  are  constant  by  definition. 

In  like  manner,  the  remaining  two  equations  in  E result  in 

- [( U-J-^  - U-j ) cos  <*1  - ( Wy ^ - W-j ) sin  a-j  J + BAz<VT,  ' V 

+ BE1  [(WT  - W-j ) cos  ot-j  + ( Uy ^ - U-j ) sin 

d = [(Uy  - u)  cos  - (wT  - w)  sin  o^]  + BAz(Vy^  - Vj) 

+ BAz  (Vy  - v)  + e"E1[(Wy^  - H,)  cos  a.,  + ( Uy ^ - U|)  sin  a.,] 
+ BE1  [(wy  -w)  cos  a-j  + (uy  - u)  sin  a-j] 

& * B7[(vt1  - vi>  - bAz(<ut1  - V cos  *1  - (“t,  - "i>  sin  V1 

(R^  cos  + P-j  sin  a^)  + Be-|  (P-|  cos  - R.|  sin  a^) 


D1®Az  = ^VT  " ‘ BAz1^uT  - C0S  al  “ ^WT  * sin  al^ 

- eAz[(Uy  - U-j)  cos  ai  " (Wy  ’ V sin  al^ 

- D-|  (r  cos  a1  + p sin  a-j)  + d ( R-j  cos  + Pi  sin  a^) 

+ D-J BE1  (p  COS  a-]  - r sin-  a-j ) + (P|  cos  ai  “ Rj  S1'n  ai ) 

+ d BE1  i (P-j  cos  a.j  - R^  sin  a-j) 


B.  Lead  Angle  Equations 


8 . 1 Perfect  Director 

Again  linearized  equations  are  needed  for  the  lead  angles  computed  by 
by  the  sights^  From  Ref.  5,  page  6-9,  the  fundamental  equation  for  the 
lead  angle.  A,  in  general,  is 

V = D[t  - (i  ■ 1)71  * VaV  + x 

where,  as  defined  in  the  above  reference 

* s ^ ’gun  ~ *ElJgun  " Vz^gun 

= average  projectile  velocity  in  still  air 

[Vf  = Df/Tf  - VA,  Df  - distance  of  flight] 

Tf  = projectile  time  of  flight 

VA  = attacker's  velocity 

Jv  = Jump  correction  factor,  = VM  " Vf , VM  is  projectile 

muzzle  velocity  VA  + VM  ” 

“gun  = Tgun  x^,  * ‘“Jgun  * ekgun>  “ and  6 here  are  a"9,e 

of  attack  and  sideslip 

• • 

Vj  = attacker's  acceleration 

Now,  if  the  actual  line-of-sight  rates  are  assumed  known  as  in  a 
director  sight,  from  the  LOS  equations  given  previously 

D[t  - (t  • ?)?]  = ? X (VT  - VA) 

• 

Furthermore,  if  the  targets  acceleration,  VT,  is  assumed  known,  we 
have  the  fundamental  equation  for  a perfect  director  in  terms  of  the 
target's  velocity  and  acceleration. 

To  obtain  the  scalor  component  equations,  define  the  following  vectors 
in  the  gun  coordinate  system 
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A = AEljgun  ' AAzkgun 
a_im  = 

gun  gun  gun 

= ATx  Tgun  + ATy^gun  + AT2kgun 

VT  = Ux  T + VT  J + WT  k 

‘ Tg  9 n Tg  9un  Tg  9un 

where  as  in  the  LOS  equation  development,  it's  important  to  remember  that 
the  target's  velocity  and  acceleration  components  are  in  the  attacker's 
gun  coordinate  system.  Carrying  out  the  vector  operation  yields 

VfXEl  ■ t6El(UT  - V'(“t  -V-  W 

9 9 * <8E,AT.  - *T  ) 4 


"Vaz  * [<VT  - Vg>  - »Az<UT„  - V1  + JVVAB 

g g t 

+ (A-,  - S*  Ay  ) _f 

Yg  Yg  2 

Now,  these  equations  will  be  expressed  in  terms  of  stability  axis 
velocities,  etc.  rather  than  gun  axis.  We  have  the  relations 

Ug  = Us  cos  a-|  - W$  sin  o1 

V = V 
g s 

Wg  = W$  cos  o1  + Us  sin  a1 

Aj  * Aj  cos  a.j  - Ay  sin  a-j 

xg  xs  zs 

AT  = AT 
vg  Ys 

Ay  = Ay  COS  O-j  + Ay  Sl^  a-| 

z z X 

g s s 

Finally,  using  the  perturbation  technique  we  introduce  the  relation 


AE1  = LE1  + SE1 

a 

a = ot-j  + o 


Ax  = At  + at 
'X.  *X, 

5 etc. 


• Introducing  these  perturbations  and  assuming  that  Vf,  T,,  and  J,  are 
constant  for  this  analysis  we  have  the  following  steady  state  relations 
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Vf*-£y=  Bei  ^(UT-j  " Ul^  cos  “l  " ^WT  ■ wi^  sin  “i^ 
-t(Wj  - W-j ) cos  ot-j  + (Uj  - U-j ) sin  ay] 

1 t 1 ’ 

-JUV.°<,  - __f[(AT  cos  a,  + At  sin  a, 

v MT  1 2 rz-|  1 TX-|  1 


BE1(AT  cos  “1  - *TZl  si"  «ll 


‘VfV  (VT,  ' V - BAz((UT,  * <V  cos  “1  - (“T,  - “l>  si"  “lJ 
+ Vab1  * - BAz<ATXl  cos  “1  - atz,  s1n  «)] 

Furthermore  the  perturbation  equations  are 

VfXEl  “ BE1^uT  " cos  a1  " (wy  “ w)  sin  a-|l 

+ 6E1^UT1  " cos  al  " (wj  ‘ Wl^  sin  ai^ 

- [(Wj  - w)  cos  ay  + (uT  - u)  sin  aj 

- Tf  (aT  cos  a,  + aT  sin  a,) 

~Z  'z  l *x  I 

- cos  ay  — ay  sin  ay) 

~ (Ay  cos  a-j  — Ay  sin  ay ) 

X1  Z1  / 

- W (< 


-VfXAz  = (Vy  - V)  - BAz[(Uy-  U)  COS  Uy  - ( Wy  - W)  Sl^  Oy] 

“6Az [(  Uy ^ ~ Uy)  COS  Oy  - (Wy^  - Wy ) Sl^  Oy] 

BAz<aT“  > 

' BAz(AT  cos  “1  - AT  s1n  V’ 

Xy  Zy 

t W'l 

B .2  LCOS  Equation 

Returning  to  equation  A in  the  previous  section,  recall  that  the  perfect 
director  assumes  perfect  knowledge  of  the  line-of-sight  rate  and  target 
acceleration.  The  LCOS  system,  however,  uses  attacker  rotation  rates  and 
acceleration  to  approximate  the  above  parameters. 

In  terms  of  the  line  of  sight  rate  in  the  attacker's  gun  coordinate 
system,  - we  have  from  Ref. 5 , page  6-35 
sAi 
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n 


B * “g  * * 


In  this  case. 


6 - (e  • s)s  = (u)g  - X)  - [(cog  - X).  si  S 


= (“g  - A)  - (Wg  • I)!  + I) 


In  addition,  the  attacker  acceleration  A.  is  substituted  in  place  of 
the  target's,  resulting  in  the  vector  equation  for  the  LCOS 


V 1 D[<»g  - x)  - <"g  ' s>sl  * VaV  * <s  x V -S 


with 


s = ■'g  + XAzjg  + XELkg 


0 


PJ-  + V'g  + Rgkg 


g g 


"'A  - Axig  + Vg  + Azkg 


and  A^j 

we  have  the  following  two  scalor  differential  equations 


DA 


EL  " 'VfAEL  + ^g  ' D(Pg  + 1Az°g  * JEL  V A3 


-JvV  * 4(ae1ax  - a2) 


(0 


and 


_DXAz  = Vftz  + D Rg  - D<Pg  * xAz  % + XEL  ■ Rg^  XEL 


* W * - xAzAx> 


The  steady  state  equations  are  of  course 

¥eL  ■ 0 ■ -¥az  + ¥l  - JvVl 


Tf  [LC1  (A  cos  a,  - A,  sin  a,)  - (A,  cos  a,  + Av  sin  a,)] 

2 lL  A*j  I I Zj  I | 

- Di LAz ^P1  cos  al  ' R1  Sin  al^  + LAzQ1  + LEL  (R1  cos  al  + 


P]  sin  c^)] 


■D-jL^2  = 0 = V^L Az  + (P]  cos  a]  + P]  sin  a-j ) 


* J,Vl  + ]f  [Ayi  - Lflz(AXi  cos  a,  - AZ)  sin  «,)] 
-DlLEL[(Pi  cos  ai-  Sin  a.,)  + LAzQ1  + L£L(R1  cos  aj  + 


P1  sin  «,)] 
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C.  Target  Kinematics 


In  the  foregoing  sections, the  equations  for  the  1 ine-of-sight  rates 
and  lead  angles  were  derived  in  terms  of  the  targets  velocities  and 
accelerations  in  the  attacker's  stability  coordinate  system.  In  this 
section  the  relations  governing  these  velocities  and  accelerations  will 
be  developed. 

Now,  in  a moving  attacker  coordinate  system  with 

^Target  = Va  + Va  + WT^A 
we  have  in  the  stability  axis  of  the  attacker 

vT  = uT  T$  + vT  T$  + WT  IT  +u  X vT 
• s s s 

Also,  since  Vy  is  the  target's  acceleration  vector,  we  can  write  it  in 
terms  of  accelerations  in  the  stability  axis  as 

Vj  = Ay  i + Ay  j + Ay  k 

xs  *s  zs 

Therefore,  we  have  the  seal  or  equations  for  the  targets  velocities  in 
terms  of  accelerations,  or 


UT  * at  + (RsVt  - QcWt  ) 

s x$  's  s 's 

*r,  = \ + (ps"ts  - Rs\> 

“t.  ■ V.  + (»sut  - PsVT  ) 


where  P$,  Q , R$  are  the  rotation  rates  of  the  attackers  stability  axis. 

We  may  now  express  the  target  accelerations  in  terms  of  the  geometry 
of  the  engagement.  In  the  case  of  a level  turn,  the  geometry  is  shown  in 
the  following  sketch  ✓ //  v 


'«*A 

tr,  l/r 


XA  and  Xy  both  in  level  plane 
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From  goemetry  we  can  then  write  the  expressions  for  the  target's 
acceleration  in  the  attacker's  coordinates 

Aj  = At  cos  Aip  + AT  sin  Aip  sin  <pT  sin  fcos^y 
XA  XT  ZT  yT 

Ay  = Ay  sin  Aip  cos  <pA  + Ay  cos  Aip  cos  Uy  - <j>A) 
yA  XT  yT 

- Ay  cos  Aip  sin  (<f>y  - <pA) 


Ay  = -Ay  sin  Aip  sin  <pA  + Ay  cos  Aip  sin  UT  - <j>A) 

ZA  XT  yT 

♦-Ay  COS  Ay  COS(*y  ) 

ZJ 

Now,  if  it  is  assumed  that  in  this  encounter  the  target  is  capable 
only  of  significant  accelerations  in  the  ZT  direction  (AT  = AT  = 0) 
we  have  ~ 


yT 


Ay  = Ay  sin  Aip  sin  <f>T 
XA  x7 

Ay  = -Ay  COS  Alp  Sin  (<py  - <pp) 

y a zt 


y = Ay  COS  Alp  COS  ( 4>y  “ (f^) 

ZA  ZT 


Substituting  these  into  Eqn.  A,  we  have 

UT  *-At  s itr  A\p  sin  $T  + (R  VT  - Q WT  ) 
s 'zy  s ‘s  s 

Vy  =-Ay  cos  Aip  sin  Uy  - $A)  + (P$Wy  - R$Uy  ) 
Wy  = Ay  COS  Aip  COS  Uy  - <t>A)  + (QsUy  - P $ Vy  ) 


Since  the  turn  rate  is  described  by  the  relation 
• = -Az  sin  » 


we  can  state  that  Aip  = ipy  - ipA  is 

-Ax  sin 


Aip  = 'T,  T 

ZT  + 


A.  sin  <f>. 


i amr  nr  "i 
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or,  approximating  with  we  have 


4*  =.  +T 


ZoaA  sin  *, 


The  perturbation  equation  for  Eqns  B and  D may  be  derived  using  the 
relations  cl 

sin  (A  + a)  3 sin  A + a cos  A 
cos  (A  + a)  * cos  A - a sin  A 

Introducing  the  familiar  steady  state  and  perturbation  variables, 
we  find  that  the  perturbation  equations  are 


• _ 

- aT  sin  A^  sin  $T  + Aip  Ay  cos 

+ <(>T  Ay  sin  A^i  cos  + rVT 
Z1  1 S1 
+ R-|yT  - <?wy  - Q-|yj 


AjL] 


sin  4>, 


Vj  = -Ay  COS  Aj^  Uy  - *A)  + pWT  + PjWy 

Z1  S1 
- r Uy  - R-jUy 

S1 

Uy  = ay  COS  A^  - Ay  SiO  A^  Alp  + <7  Uy 

z z1  S1 

+ ^1*^1  ” ” ^1UT 


(E) 


And,  assuming  AT  3 A.  , 4>T  = 4>A  , VT  = V.  we  have 
Tz,  \ T1  A1 

4Vrt  * 4A[AZl  cos  *1UT  • *A>  * s1"  ‘l  (aTz  - Zaa)1 

Finally,  to  model  the  target's  acceleration,  ay  , and  bank  angle,  <Py » 
one  approach  is  to  assume  a Markov  process  in  the  form 


(*) 
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D.  F106  Perturbation  Equations 

In  this  section,  the  linearized  equations  for  the  vehicle  dynamics 
are  suirmarized.  From  Reference  8 , the  perturbation  equations  for  a 
general  steady-state  flight  condition  are  as  follows: 

u - V-jr  - R-|U  + = -gecose^  + X^u  + Xjj  + Xfi  6^ 

v + U.|r  + R-|W  - W^p  - P^u  = -gesin$.|  sine^ 

+ g<(>cos$1  cose1  + Yyv  + Y^p 

+ V + \ A + VR 
w - + V^p  + P.|V  = -gecos<i>i  sine^ 

- g<()Sin$^  cose1  + l_u 
+ Z w + Z„  6, 


w 


VE 


1 


« + T ^xx-  Izz)f,l*'+  M + Ixz/.  <2V“2R1 

. xyy 

= H u + M w + M'w  + H q + H.  6C 
u w w ^ Sr  E 

p ' Ixz/lxxr  ' V,  <Pl<!  + Q,P) 

Xxx 

* T^’zz  - V,R1«  + V>  = 4s  + y 


r) 


+ L r + L.  6.  + L.  6„ 

r 5A  A 6R  R 


ixz/  p + i«(Qi*  + Ri<?)  + T (Jyy  - + Qlp) 

xzz  xzz  zz 


■ v + y + v + Va  + \sr 


where  in  these  relations  the  (•),  quantities  (e.g. , U, , V,,  W, ) are  constant, 
steady-state  values,  and  X , Z , etc  are  the  dimensional  Stability  derivatives 
(see  the  above  reference). 

The  case  to  be  evaluated  consists  of  a level,  4-g  turning  condition  at 
10,000  ft  altitude,  Mach  = 0.72.  Therefore,  for  this  case,  the  steady-state 
parameters  are  given  as  follows  (equations  developed  in  the  aircraft  stab- 
ility axis) 


* Si. 


U-j  = V = 775  ft/sec 
V-,  = W-,  = 0. 


0-j  = 0. 

cos  *1  ■ W ■ T 


4>1  = 75.52  deg. 


Turn  rate,  = g(tan  $1^/U-|  = .161  rad/sec. 
Roll  rate,  = 0 

Pitch  rate,  Q-|  = ^ sin  = .156  rad/sec 
Yaw  rate,  R-j  = j£  cos  = .040  rad/sec. 


The  angle  of  attack  required  (Ref. 9 ) is  : 14.5°  and  the  elevator 

deflection  for  trim  is  -20  deg  at  this  altitude  and  velocity.  The 
inertias  (in  the  body  axis)  are 

I = 2.126  x 104  slug-ft2 

AA 

I = 2.035  x 105  slug-ft2 

I « 2.175  x 105  slug-ft2 

I = 7.316  x 103  slug-ft2 

XZ  , 

2 

The  reference  wing  area  is  695  ft^and  the  weight  is  33,000  pounds. 

In  the  stability  axis,  the  inertias  are 
I = 2.999  x 104  slug-ft2 

AA$ 

I = 2.088  x 105  slug-ft2 

I = -4.111  x 104  slug-ft2 
xzs 

Finally,  the  demensional  derivatives  are  estimated  to  be  (for  c =m.a.c. 
= 23.8  ft) 


-.0924 

-176.0 


Ye=  -177. 


X*  = -24.4 

Zu  = -437 
Za  = "98?- 

Z5  = -288. 

MuE=  “.00223 
Ma  = -11.5 
M?  = - . 785 ( . 3485) 
Mq  = - . 785 ( . 3485) 
M.  « -17.36 


S - -47-6 

Lp  = -.0613 


> = -57-4 
\ . .567 


Np  - -.176 

N - -.842 
r 


N . = -5.93 
5A 

N x = -3.98 
6R 


Finally,  In  the  steady-state  condition  being  considered,  the  aileron/ 
rudder  interconnect  results  in  the  following  relation 

5r  " SR  commanded  "^‘^A 

And,  in  addition,  the  feel  system  for  this  vehicle  is  estimated  to 
provide  the  following  stick  gains 
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(in.) 

A 

Astick 

6D 

s .0738 
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stick 

6 = -.2286  + .0104$  + .002 27p  - .99r 
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Ast  Kped 

p = - .0245c?  - 77.98  + .198p  + 5.27 r 

- 1.8436.  + . 6086D 

Ast  Rst 

r = .0127c?  + 22.16  - .344 p - 1.852r 
+.2016.  -.4  136p 

Mst  Kst 

In  addition  to  the  vehicle  dynamic  model,  the  vehicle  kinematics  are 
described  by  the  perturbation  relations 
• • 

p = $ - i-|  0 

• • 

q = e cos$^+  i-j  $ cos$-|  + $ sin$^ 

z.  • • 

v = -jj^  $sin$^  + $cos$-|  - 0 sm$.| 
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In  our  case,  solving  for  0,  <p,  and  <ji  yields 

<p  = p + .1610 

0 = -.16l4>  + *25<?  - . 968r 
\p  - .968 q + .25r 

Equations  C and  D now  constitute  the  final  vehicle  model. 
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. « 

Abstract 

Catalytic  flame  stabilization  utilizes  a solid  catalytic  surface 
to  stabilize  and  provide  a continuous  pilot  for  the  flame  propagation 
wave.  A test  program  is  being  set  up  to  obtain  and  compare  the  per- 
formance of  a J-85-5  turbojet  afterburner  using  a conventional  bluff 
body  flame-holder  and  several  different  catalyst  coated  and  uncoated 
ceramic  substrate  screens  for  flame-holders.  Substrate  materials  to 
be  used  are  Cordierite  and  Silicon  Carbide.  The  catalyst  coating's 
will  be  platinum  and  palladium.  Least  squares  mathematical  treatment 
and  integration  of  the  data  will  be  used  to  obtain  the  overall  emission 
concentrations  and  the  overall  combustion  efficiency.  It  is  hoped 
that  test  results  will  bear  out  modeling  predictions  for  an  increase, 
in  combustion  efficiency  and  decreased  pressure  drop  through  the 
flame-holder.  Preliminary  work  and  testing  thus  far  has  been  in  prep- 
aration for  establishing  base  line  performance.  Final  test  results 
should  establish  criteria  for  selection  of  both  substrate  and  catalytic 
materials  to  be  used  for  flame-holders  in  afterburners. 
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This  report  is  to  present  the  progress  to  date  for  an  on  going  research 
project  comparing  the  performance  of  the  J-85-5  afterburner  using  a conventional 
bluff  body  flameholder  and  several  different  catalyst  coated  and  uncoated 
substrate  screens  for  flameholders.  This  work  is  being  performed  as  part  of  the 
USAF-ASEE  summer  facility  research  program  in  the  Fuels  Branch,  Fuels  and 
Lubrication  Division,  of  the  Wright-Patterson  Air  Force  Aero  Propulsion 
Laboratory. 
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INTRODUCTION 

Various  studies  have  indicated  that  improvement  in  the  performance  of 
aircraft  afterburning  engines  can  be  expected  through  the  use  of  a catalytic 
flame  stabilization  device  in  the  afterburner.  Flameholder  modeling  studies 
have  indicated  that  am  increase  in  efficiency  and  am  improvement  in  the  percent 
pressure  loss  can  be  expected  through  the  use  of  a catalyst  coated  substate 
screen  flameholder. 

The  reseatrch  test  program  was  to  compare  results  using  a conventional 
V-shaped  bluff  body  flameholder  and  a honeycomb  structure,  catalytic  flame- 
holding  device.  The  designs  to  be  used  were  obtained  by  mathematically 
modeling  a flameholder  for  a J-85-5  afterburning  turbojet  engine.  The  overall 
purpose  of  this  experimental  program  is  to  determine  the  feasibility  of 
catalytic  flameholding  devices  to  stabilize  the  flame  in  aircraft  afterburners. 

To  accomplish  this  purpose  and  to  continue  effort  on  work  already  done, 
the  following  specific  objectives  were  set  up: 

1.  To  prepare  the  engine  and  data  acquisition  system  for  actual  testing: 
This  has  involved  instrument  trouble-shooting,  calibration,  and  measurement  and 
calibration  of  the  engine  nozzle  and  the  radial  probe  position  readout. 

2.  To  set  up  a test  program  and  procedure.  This  will  be  discussed  later  in 
this  report. 

3.  To  debug,  revise,  and  update  the  data  reduction  computer  program  which 
will  calculate  and  integrate  the  data  to  obtain  the  overall  combustion 
efficiency  and  the  overall  emission  concentrations.  The  program  for  calculations 
will  be  discussed  later  in  this  report. 

The  test  program  is  being  performed  using  a J-85-5  engine  at  the  Air  Force 
Aero  Propulsion  Laboratory . The  work  thus  far  has  been  mainly  preparation  for 
base  line  and  actual  testing.  The  engine  has  been  run  three  times  to  establish 
jet  stream  diameter  limits,  to  make  a hydrogen  check  and  velocity  profile,  and 
to  check  the  tape  data  handling  procedure.  The  next  step  is  to  establish  base 
line  performance  characteristics  using  the  conventional  bluff  body  flameholders . 
The  major  performance  advantages  expected  are  increased  afterburner  combustion 
efficiency  and  reduced  flameholder  pressure  drop. 
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APPARATUS 


The  J-85-5  jet  engine  has  an  eight  stage  compressor,  and  annular  com- 
bustion chamber,  a two  stage  turbine  and  an  afterburner.  The  afterburner 
consists  of  a diffuser  section  housing  the  flameholders , a cylindrical  double 
walled  annular  burning  section;  and  finally  a variable  exhaust  nozzle.  The 
major  engine  sections  are  shown  in  Figure  1. 

The  afterburner  consists  of  a straight  through  duct  equipped  with  a flame- 
holder  needed  to  stabilize  the  reacting  fuel/air  mixture.  The  afterburner's 
function  is  to  augment  the  maximum  thrust  that  can  be  produced  by  the  main  engine. 
Due  to  high  gas  temperatures  causing  dissociation  of  H2O  and  CO2  and  short 
residence  time,  afterburners  do  not  operate  at  combustion  efficiencies  as  high  as 
in  the  main  burner.  It  is  hoped  that  through  the  use  of  a catalytic  flame- 
holding device,  faster  reaction  rates  and  thus  higher  efficiencies  can  be  achieved. 
This  may  also  result  in  being  able  to  use  a shorter  length  of  afterburner.  The 
afterburner  assembly  is  shown  in  Figure  2.  The  diffuser  components  showing  the 
fuel  system  and  the  existing  bluff  body  flameholder  is  shown  in  Figure  3. 
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MAJOR  ENGINE  SECTIONS 
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AFTERBURNER  ASSEMBLY 


PILOT  SPRAYBAR  <4| 


DIFFUSER  COMPONENTS 


C- STAND  FACILITY 


A brief  description  of  the  facility  and  instrumentation  used  in  conducting 
this  study  will  be  presented  in  this  section. 

The  facility  consists  of  the  engine,  control  system,  gas  sampling  system 
and  a data  acquisition  system.  The  engine  is  started  and  controlled  from  the 
control  room  using  a typical  remote  engine  control  panel  as  shown  in  Figure  4. 

The  gas  sampling  draws  a continuous  sample  of  exhaust  gases  by  means  of  a 
movable  single  element  probe.  This  probe  has  freedom  of  movement  in  both  the 
horizontal  and  vertical  plane.  Probe  positioning  is  accomplished  by  means  of  two 
remotely  controlled  stepping  motors,  one  providing  horizontal  motion,  the  other 
providing  angular  displacement  in  the  vertical  direction.  Probe  position  is 
displayed  on  a digital  voltmeter  on  the  engine  control  panel.  The  temperature  of 
the  sample  is  maintained  at  about  300 °F  by  means  of  hot  water  circulating  through 
the  probe  assembly  and  electric  heating  of  the  line  back  to  the  control  room. 

The  gas  analysis  instruments  measure  CO,  THC (total  hydrocarbons),  CO2  and  H2. 
Calibration  is  accomplished  by  using  calibration  gases  of  known  concentrations. 

The  CO  and  CO2  are  measured  with  a Beckman  non-dispersive  infrared  instrument. 

The  total  hydrocarbons  are  measured  with  a Beckman  Flame  Ionization  Detector  (FID) 
type  instruemnt.  The  hydrogen  is  measured  with  a Beckman  Model  6700  Gas 
Chromatograph.  The  Emission  Instrument  set  up  is  shown  in  Figure  5.  Schematics 
of  the  gas  analysis  sampling  and  data  handling  are  shown  in  Figures  6 and  7. 

The  data  acquisition  system  consists  of  test  parameter  transducers,  signal 
conditioners,  amplifiers,  panel  digital  readout,  and  means  for  recording  data  on 
magnetic  tape.  The  system  provides  for  one  voltage  reading  per  channel  and  has  a 
capacity  of  100  channels  for  data  acquisition.  Twenty  eight  channels  will  be 
used  to  obtain  data  for  this  research  study.  All  test  parameters  inputs  are 
conditioned  to  a ten  volt  DC  scale  by  a bank  of  amplifiers . Each  parameter  has 
its  own  transducer  pick  up  and  individual  amplifier.  The  signals  are  read  out 
on  a common  modular  panel  digital  readout.  The  28  channels  may  may  be  called 
individually  or  set  so  that  the  instrument  will  run  a scan  on  all  active  channels 
consecutively.  The  scan  rate  can  be  varied  from  1 channel  per  second  up  to  15 
channels  per  second.  For  this  project  the  scan  rate  will  be  set  at  2 channels 
per  second.  A schematic  of  the  data  acquisition  system  is  shown  in  Figure  8. 
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FLAMEHOLDER  DESIGN 


The  overall  purpose  of  a flameholding  device  is  to  slow  down  the  velocity 
of  the  exhaust  jet  stream  entering  the  afterburner  in  the  vicinity  of  the  fuel 
supply  so  that  a flame  front  may  be  established  and  remain  stable.  If  the  velocity 
of  the  stream  is  faster  than  the  rate  of  flame  propagation  the  stream  will  literally 
blow  the  flame  out  of  the  exit.  If  slower  the  flame  front  will  try  to  propagate 
back  upstream  and  blow  itself  out  for  lack  of  fuel.  The  problem  then  is  to  provide  a 
device  that  will  stabilize  the  overall  combustion  processes  by  providing  a 
continuous  pilot  for  the  flame  propagation  wave. 

The  disadvantages  of  the  bluff  body  flameholders  are  as  follows : 

1.  High  stream  blockage  with  attendent  high  pressure  drop  across  the 
f lameholder . 

2.  No  means  of  modulating  the  f lameholder  for  various  power  settings. 

3.  Overheating  of  the  f lameholder  may  become  a problem  with  the  trend 
toward  higher  turbine  inlet  temperature. 

The  use  of  ceramic  substate  materials  with  a catalytic  coating  for  flameholding 
should  alleviate  some  of  the  above  problems.  Recent  automotive  emission  requirements 
have  provided  the  incentive  for  the  production  of  various  ceramic  substate  materials 
that  are  durable  at  high  temperatures.  High  temperature  ceramic  materials  that  sure 
currently  available  are: 

Cordierite  (2MgO  - 2AL203  - 5Si02) 

Silicon  Carbide  (SiC) 

Alumina  (AL2O3) 

Zirconia  (Zr02) 

The  materials  selected  for  the  J-85  flameholder  study  were  Cordierite  and 
silicon  carbide.  Of  the  four  listed  above  cordierite  has  the  highest  strength  and 
shock  resistance  with  silicon  carbide  having  the  lowest  strength  and  shock  resistance. 
Thus  these  two  materials  will  provide  data  at  opposite  extremes. 

Catalyst  surface  coating  materials  that  have  been  effective  in  promoting  com- 
bustion and  minimizing  emissions  include:  platinium,  palladuim,  iriduim,  cobolt  oxide, 
and  chromia.  Platinum  and  palladuim,  materials  used  in  automotive  catalytic  con- 
verters, have  been  selected  as  the  coating  materials  for  the  flameholder  tests.  Several 
different  sizes  (inside  and  outside  diameter)  and  thicknesses  will  be  tested.  The 
catalytic  flameholder  construction  to  be  used  is  shown  in  Figure  9. 


Combustion  research  with  catalysts  has  already  established  that  for  best 

operation  the  fuel  should  be  completely  vaporized  and  that  high  temperature 

operation  produces  the  best  results.  For  the  afterburner  application  inlet 

temperatures  axe  quite  high,  however  the  catalyst  may  be  subjected  to  variable 

rich  or  lean  mixtures.  It  is  hoped  that  the  testing  will  establish  criteria  for 

•c 

the  selection  of  both  substate  and  catalytic  materials  to  be  used  in  an  afterburner 
application . 

The  following  table  shows  the  various  types  and  sizes  that  will  be  used  in  the 
test  program. 


J-85-5  AFTERBURNER  FLAME  HOLDER 
CONFIGURATION  DESIGN  AND  SIZES 


DESIGN 

SIZE (INCHES) 

CATALYST 

SUBSTRATE  MATERIAL 

QUANTITY 

A 

8.4  x 14  X 1 

yes 

Cordierite 

1 

A 

8.4  x 14  x 1 

no 

Cordierite  * 

1 

B 

8 x 14  x 2 

no 

Cordierite 

2 

B 

8 x 14  x 2 

yes 

Cordierite 

2 

C 

7.6  x 14.6  x 1 

yes 

Cordierite 

1 

D 

7.6  x 14.6  x 2 

yes 

Cordierite 

2 

E 

5.9  x 14.8  x 1 

yes 

Cordierite 

1 

F 

5.9  x 14.8  x 2 

yes 

Cordierite 

2 

B 

8 x 14  x 2 

no 

Silicon  Carbide 

3 

B 

8 x 14  x 2 

yes 

Silicon  Carbide 

3 

D 

7.6  x 14.6  x 2 

yes 

Silicon  Carbide 

3 
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COMBUSTION  OF  A HYDROCARBON  FUEL  WITH  AIR 

Dry  air  is  a mixture  of  gases  with  the  following  volumetric  analysis  in 
per  cent: 

02  - 20.99 

N2  - 78.03 

A - 0.94 

C02  - 0.03 

H2  - 0.01 

Argon  listed  in  the  above  includes  traces  of  neon,  helium  and  krypton. 

For  most  calculations  dry  air  can  be  considered  as  21  percent  by  volume 

oxygen  and  79  percent  nitrogen.  Moisture  content  varies  widely  depending 

upon  local  conditions.  The  moisture  content  will  add  am  additional 

amount  of  essentially  inert  material. 

These  calculations  will  consider  the  following: 

.21  moles  of  02  + .79  moles  N2  = 1 mole  air 

Ma  - molecular  weight  of  air 

Mf  - molecular  weight  of  fuel 

Mn  - molecular  weight  of  nitrogen 

Ma  = 28.964 

Mn  = 28.161  (This  is  weighted  to  include  all  of  the  inert  gases) 

Humidity  will  be  considered  negligible. 

Fuel  will  be  considered  as  C^  HHCR. 

where:  HCR  - hydrogen/carbon  ratio  in  the  fuel, 
and  Mf  - 12  + HCR 

The  combining  equation,  with  most  of  the  gaseous  constituents  expected 
in  the  exhaust  from  a turbojet  engine,  on  a mole  of  air  basis  is  as  follows: 


15 


— 


*« 


FAR  C HhCR  + .2102  + -79N2  = a CO2  + b H2O  + c CO  + d H2  + e CH4  + f O2  + .79N2 
where:  FAR  - fuel  air  ratio  by  mass 

HCR  - hydrogen  - carbon  ratio  in  the  fuel 

a,b,c,d,e,f-  mole  volumes  of  CO2,  H2O,  CO,  H2,  CH4  and  O2 
respectfully. 

Unbumed  hydrocarbons  are  considered  as  CH4. 

Balancing  the  atoms  of  each  constituent: 


C: 

(FaE1  57 

= a + c + e 

(1) 

H: 

(HCR)  = 2b  + 2d  + 4e 

(2) 

0: 

.21(2)  - 

2a  + b + c + 2f 

(3) 

MT 

= a + b + 

c+d+e+f  .79 

(4) 

where:  MT  - Total  moles  of  exhaust  per  mole  of  air 
The  exhaust  constituents  that  are  measured  are  as  follows: 

CO2  - in  percent  by  volume  on  a dry  basis 

CO  - ppm  by  volume  on  a dry  basis 

H2  “ ppm  by  volume  on  a wet  basis 

CH4  - ppm  by  volume  on  a wet  basis 

H2O,  and  02  are  not  measured.  Considering  that  generally  for  any  particular 
fuel  HCR  is  known,  examination  of  the  above  four  equations  reveals  four  unknowns 
namely  FAR,  b,  f,  and  MT. 

To  solve  the  above  four  equations  rewrite  in  terms  of  the  constituents  as 
measured . 

a = £22.  (mt  - b) 

100 

c " §6  (MT  “ b> 
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d «■  gfc  (MT) 
e = WT) 

N.  B.  Only  CO2  and  CO  taken  on  a dry  basis. 

Rewriting  the  atom  balance  in  terms  of  the  constituents  as  measured. 

C:  (FAR)  £— ■ = (MT  - b)  + ^5  (MT  - b)  + (MT)  (5) 

H:  (FAR)  ^ (HCR)  - 2b  + (MT)  + 4CH4  (MT)  (6) 

Mf  10  "lOb 

O:  .21(2)  = ^22.  (mt  - b)  + b + (MT  - b)  + 2f  (7) 

MT  = TOO  (MT  ' b)  + b + X§6  (MT  - b)  + gg  (MT)  + MT  + f + ’79  (8) 

Express  b = f (MT,  HCR,  FAR)  from  the  hydrogen  balance  equation  (6) 

2J)  - (FAB)  (HCR)  jjA.  _ 2S£  (MT)  . 1S|  (ht) 


fc-S.  « - 


Express  f = function  (FAR,  MT)  from  the  oxygen  balance  equation  (7) 
substitute  the  above  value  for  b 


2 CO 9 . . 

•42  = loo  (MT) 


_ 2C02  r (FAR).(JCR)-  Ma  . H2  (MT)  . 2CH, 

inn  1 9 Mf  105  1 no  ' ' J 


(FAR)  (HCR) 


Sf-5.  ^ (mt,  - & , s*  -a,  « 


(MT)  J + 2f 


solving  for  f,  number  of  moles  of  O2 

f - .21  ♦ (A,  <SS)  ffisisaai  + ,B)  «| 


where-  A = 222.  +.  22 _ 1 

wnere . a 100  + 2 x 10b  j 
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n _ Si.  a.  t2)CHA  _ C2)C02  2(C0?)  H2  _ C4)  CC02)  (CH4)  CO  (CO)  (H?)  (2)  (CO)  (CH„ ) 

106  106  100  IP  ' Ip2  " 136 1^2  I5T5 


Note:  A,  B,  C,  D etc.  will  refer  to  groups  of  quantities  obtained 
from  test  data 

Express  MT  ■ f (b,  MT)  using  the  carbon  balance  equation  (5) 

5?-  ‘15^156  + gf  1 «-  !?&♦§*;  » 

substitute  b obtained  from  the  hydrogen  balance 

(FAR)  ^ = [COi.CO  Oh  C02.  M^lHCR )_  CO^  x H2  , 

Mf  1 100  10 6 lP  J 1 100  Mf  2 J ™ + 1 100  lO6  J 


100  10fc 


Rearranging  and  solving  for  FAR 


(C)  M*  (MT) 

FAR  - ToTMa 


Fuel  air  ratio 


(2)  (CO)  (CHa) 

— 1512 — 


„ _ , . (HCR)C02  . (HCR)CO 

D 1 + — i3o  + mp 


Now  solve  the  above  for  MT  (Total  moles  of  exhaust  per  mole  of  air) 

MT  = <*&)<?)  fiki  (9) 

(C)  (Mf ) t9) 

Substitute  MT,  b,  and  f in  equation  (S)  and  solve  for  FAR;  first  rearrange 
factoring  out  MT  and  b 

1 1 - m - & - §>  - 1 - j&-  & 1 1.  * * ♦ .79 


Now  substitute  MT,  b,  and  f in  the  above 
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r i ££2.  22  S2.  CHd 

1 100  10®  " 10®  " To®  J 


“2. 

~ 10®  x 


(C) (Mf ) 


(FAR)  (D)  (Ma) 
(C) (Mf) 

(D)  (FAR)  (M, 
(C)  (Mf) 


. . CO?  CO  . . 
1 1 ~ 100  ' 10®  ] 1 


] + .21 


(FAR) (HCR) 


M-  (FAR)  (HCR)  (D)  (FAR)  (Ma) 

W 2 +B  (2 ) (C)  (Mf ) + >79 

Grouping  terms  and  solving  for  FAR  (fuel  air  ratio) 


FAR  = (n^-) 


Ma  , „ C02  CO  H2  CHa.  D (B)  (D)  . (E)  (H5)  (D)  . 2CH4 

1 (1  ■ loo  lo®  ■ lo  ■ Iot}  c ■ JITW  + (C)  io*~  + (E)  -T$  x 


A (HCR)  E (HCR) 


. „ , C02  CO 

where:  E = 1 - 10Q  - 1q6 


Substitute  A,  B,  C,  D,  and  E,  cancel  terms  where  possible  and  reduce  the  above 
to  the  following.  Note  that  terms  A,  B,  and  E no  longer  appear,  however 
a new  group  of  terms  called  F appears. 


EME-aiQ] 


(HCR) (C) 


10®  10 
(HCR) (C02) 


l - 32. 

2x106 


CO 

2x10® 


(HCR)  (CO) 

2 x 10^ 

_ (CO)  (Ho) 
2X101* 


For  convenience  in  checking  results  while  setting  up  the  main  program  both 
FAR (10)  and  MT(9)  were  programed  and  put  on  file.  These  two  programs  cure 
presented  in  the  appendix.  MT  or  total  moles  is  referred  to  as  program 
DELTA.  The  fuel  air  ratio  is  referred  to  as  program  FAR. 


TEST  PROGRAM 


The  objective  of  the  test  program  is  to  obtain  and  compare  the  performance 
of  the  J-85  afterburner  using  a conventional  bluff  body  flameholder  and  several 
different  catalyst  coated  and  uncoated  substrate  screens  for  f lameholders . The 
desired  performance  results  are  combustion  efficiency  and  flameholder  pressure 
drop.  The  major  independent  variable  will  be  fuel  flew  (engine  power  setting) . 

There  will  be  four  different  power  settings  used  for  each  test  configuration. 

These  will  be  designated  as  military,  minimum  afterburner,  mid-afterburner  and 
maximum  afterburner.  The  independent  variable  at  each  power  setting  will  be  the 
exhaust  probe  position.  A traverse  will  be  made  across  the  diameter,  approximately 
eight  inches  from  the  engine  exhaust  nozzle.  There  will  be  eleven  probe  stations 
at  the  center  of  area  of  equal  concentric  areas  across  the  exhaust  plane. 

Since  it  is  virtually  impossible  to  accurately  obtain  exhaust  temperature 
measurements  a complex  mathematical  procedure  based  on  thermodynamic  theory  is  used 
to  obtain  the  exhaust  temperature.  This  procedure  being  developed  using  a computer 
program  will  be  summarized  as  follows: 

The  program  calculates  the  local  adiabatic  flame  temperature  assuming  100% 
combustion  efficiency.  Using  this  temperature  the  program  calculates  equilibrium 
constants  then  calculates  the  number  of  moles  of  each  exhaust  gas  constituent  at 
equilibrium  conditions.  The  program  then  takes  the  equilibrium  conditions  combined 
with  the  actual  emissions  and  calculates  the  local  actual  combustion  efficiency. 

Using  this  efficiency  the  program  again  uses  an  iterative  procedure,  as  used  to  get 
the  adiabatic  flame  temperature,  and  calculates  what  approaches  a true  total  exhaust 
temperature.  With  this  value  the  static  temperature,  density,  and  velocity  can  be 
calculated.  Now  local  data  is  reduced  using  a least  squares  polynomial  curve  fit, 
integrating  the  results,  normalized  using  the  measured  mass  flow  to  finally  obtain 
the  overall  emission  concentrations,  the  calculated  overall  mass  flow,  and  the 
overall  combustion  efficiency.  A check  on  the  accuracy  of  the  test  sampling  and  data 
reduction  can  be  made  by  comparing  the  measured  mass  flow  with  the  mass  flow  cal- 
culated from  the  emission  data. 


One  of  the  problems  in  obtaining  data  is  locating  the  edge  of  the  exhaust 
stream.  Generally  this  is  taken  as  the  radial  location  where  the  impact  pressure 
equals  the  ambient  pressure . However  due  to  local  turbulence  producing  vortices  at 
the  edge  of  the  jet  stream  the  aforementioned  method  may  not  accurately  locate  the 
edge  of  the  exhaust  stream.  Another  problem  is  in  calculating  the  overall  com- 
bustion efficiency  based  on  the  exhaust  emissions.  This  is  weighted  by  multiplying 
the  local  efficiency  by  the  local  mass  flow  and  integrating  the  result  across  the 
exit  area  then  dividing  by  the  measured  mass  flow . This  can  best  be  shown  by 
mathematical  equations  as  follows : 

(Mass  Flow)m  n 51  /*  Hj,  d (Mass  Flow)^ 
where : 

n - overall  efficiency 
- local  efficiency 
subscript  m - measured 
subscript  l - local 

d(Mass  Flow)^  = p(Vel)dA 
dA  = 2tt  rdr 

(Mass  Flow)  j,  = 2ir  p(yel)  rdr 

and 

(Mass  Flow)^  n =»  2it  (.nj,  P Vel)  rdr 

(nj,  p Vel)  may  be  expressed  as  a function  of  the  radius.  This  function  is  obtained 
using  a least  squares  polynomial  curve  fit.  Thus  the  final  expression  for  the 
overall  combustion  efficiency  based  on  sampling  exhaust  across  the  exit  area  is : 

2u  /I  Chi  p Vel)Cr)rdr 

n = - — 

(Mass  Flow)m 

where:  R - the  effective  exhaust  stream  radius 
p - density  in  lbm  per  cuft. 

Vel  - local  velocity  in  ft  per  sec 
r - radius  in  feet 
Mass  Flow  - in  pounds  per  second 
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It  can  easily  be  seen  from  the  above  expression  that  this  may  lead  to  a 
problem,  even  though  data  may  be  acceptable,  if  the  measured  mass  flow  was  higher 
than  the  integrated  mass  flow.  This  would  cause  the  overall  combustion  efficiency 
to  come  out  over  100  percent.  To  get  around  this  problem  and  also  to  more 
accurately  locate  the  edge  of  the  exhaust  stream  and  iterative  look  was  put  into 
the  calculation  program  that  causes  the  integration  to  proceed  to  a radius  that 
will  cause  the  measured  mass  flow  to  equal  the  integrated  mass  flow  and  thus  keep 
the  calculated  efficiencies  on  the  proper  side  of  100  percent.  ( As  far  as  we 

4 

know  this  method  is  unique  to  our  project  other  research  studies  have  generally  used 
some  arbitrary  method  in  establishing  the  edge  of  the  exhaust  stream) 

The  calculation  for  p Vel  plays  an  important  part  in  obtaining  the  calculated 
results.  Some  detail  showing  how  calculations  concerning  p Vel  are  handled  is 
given  in  the  appendix. 


DATA  PROCESSING 


Twenty-eight  test  parameters  are  measured  and  recorded  on  magnetic  tape  by  means 
of  the  data  processing  equipemnt.  Preparation  for  actual  testing  involved  obtaining 
a calibration  curve  for  each  channel  and  then  getting  a curve  fit  equation  so  that 
the  voltage  signal  recorded  on  tape  could  then  be  converted  to  the  actual  parameter 
value.  As  an  example  of  this  the  conversion  procedure  for  the  probe  position  and 
the  nozzle  position  indicator  are  shown  in  the  appendix.  Copies  of  the  calibration 
data  are  also  included. 

A program  was  written  to  read  the  engine  operating  and  emissions  data,  print 
the  data  values  and  finally  punch  cards  for  input  into  a subsequent  calculation 
program.  The  test  parameters  and  outline  format  for  fixed  data,  output,  data 
summary,  and  program  output  sure  shown  in  the  appendix. 
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' CHANNEL  NUMBERS 


VARIABLE 


OLD 

NEW 

PARAMETER 

NAME 

CALIBRATION  CURVE 

1 

1 ’ 

E.G.T. 

T5 

T5  = 45 . +900 . *V/4 . 2 

3 

2 

Inlet  Temp 

TO  ' 

TO  = 32 ,1-  50 . *V/1 .40 

5 

3 

Main  Fuel 

WFM 

WFM  = 1000. *V 

6 

4 

A-B  Fuel 

WFAB 

VIF/iB  = 1000.  *V 

7 

5 

NO/NOX 

NO 

NO  = 0.1 *V*RNO 

8 

6 

CO 

CO 

CO  = 0 . l*V*RCO 

9 

7 

T.H.C. 

THC 

THC  = 0.1*V*RTI1C 

10 

8 

co2 

C02 

co2  = o.i*v*rco2 

New 

9 

«2 

. 

h2 

h2  = 0.1*V*RH2 

21 

10 

X Probe  Pos 

XDIST 

XDIST  = -0. 003333*V  - i J 7 ' 

22 

11 

Y Probe  Pos 

ANGLE 

ANGLE  = 0v  17857 *V  * 7.  o 2 / ' 

2 

12 

NPI 

1 

r>  A8 

1 ,*  A8  = V*tr5-3-34+98.-585 

New 

13 

R.P.M. 

NG 

NG  = 16542.0  '•4rr0j£lV-  t/f  /_• 

11 

14 

Inlet  Diff. 

n 

DP 1(1) 

DP  1(1)  = V 

12 

15 

Inlet  Diff. 

n 

DP1(2) 

DPI (2)  ~ V 

13 

16 

Inlet  Diff. 

4f3 

DP1(3) 

DP  1(3)  = V 

14 

17 

Inlet  Diff. 

#4 

DP  1(4) 

DP  1(4)  = V 

15 

18 

Inlet  Total 

(fi 

PTl(l) 

PT  1(1)  = 10. +V 

16 

19 

Inlet  Total 

#2 

PT1(2) 

PT  1(2)  = 10. +V 

17 

20 

Inlet  Total 

#3 

PT1(3) 

PT1(3)  = 10.  +V 

18 

21 

Inlet  Total 

H 

PTl(4) 

PT1(4)  - 10. +V 

4 

22 

Thrust 

FG 

FG  >=  1000. *V 

20 

23 

Probe  Tot.  1 

Press 

. PT8 

PT3  = 5 . * V 

19 

24 

New  Press. 

(AMB) 

PO 

PO  = 10.  -tv 

Hew 

25 

New  Press. 

(FLH) 

PT5 (1) 

PT5  (1)  = 10. *V 

New 

26 

Hew  Press. 

(FLK) 

PT5 (2) 

PT5 (2)  = 10. *V 

New 

27 

New  Press. 

(FLH) 

PT5 (3) 

PT5 (3)  = 10. *7 

New 

28 

New  Press. 

(FLH) 

PT5 (4) 

PT5 (4)  = 10. *V 

NOTE:  The  ranges 

for  emission 

instruments.  RNO, 

RN02,  etc.  change  with  operating 

• conditions. 

The  table  below 

indicates  the 

instrument  range  fer  engine 

operating  condition  being  run. 

OPERATING  CONDITION  RNO 

RCO 

RTHC  RC02  RH2 

MILITARY 

• 

. 50 

2000 

50  10  500 

MIN  A/B 

50 

20000 

10000  10  5000 

MID  A/B 

100 

20000 

. 5000  # 5000C 

MAX  A/D 

100 

’ 20000 

2500  tf  50000 

(I  002  ■>  . 00072GB  + .5457  * V + .20701  * V *'*  2 


- 1.036033  * V * * 3 + .0043749  * V * * 4 

- .00014267  * V * * 5 


7 'iS  x l.  \ *=  5 — 


C i 7ri  v i.  *-£  i-  c-. 
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/ C 


Fixed  Data  Format 


Date 

Engine  Condition  Code 

Configuration  Code 

Data 

Point 

xxxxxx 

XXX 

XXX  XXX  XXX 

XXX 

XXX 

Date  Code 

Day:  01  - 31 
Month:  01  - 12 
Year:  78  - 80 


Engine  Condition  Code 


Military  Condition: 
Min  A/B  Condition: 
Mid  A/B  Condition: 
Max  A/B  Condition: 


000 

100  - 250 
250  - 400 
410  - 600 


Configuration  Code  (per  cat,  disk) 

Design  Type:  . 0 - Conventional  Hardware 

. 1 - Design  A 

2 - Design  3 

3 - Design  C 

4 - Design  D 

5 - Design  E 

6 - Design  F 

7 - No  Disk 


Substrate  Material:  1 - Cordc-rito 

• 2 - Silicon  Carbide 

Catalyst  Coating:  0 - No  coating 

1 - Pt/Pd  coating 


Data  Point  Code 
Data  point:  000  - 100 

U • 

Subpoint:  000  - 050 


Hie  format  on  the  preceding  page  shall  be  used  for  subpoints 
2 thru  n.  Following  subpoint  n,  an  overall  data  summary  shall  be 
presented  in  the  format  shown  on  the  succeeding  page. 

Following  the  data  summary  output  a set  of  summary  data  cards 
shall  be  punched.  The  format  shall  be  as  shown  below: 

* ' V 

PUNCH  1000,  DATAPT,  T5,  TO,  WFM,  WFAB,  A8,  NG,  DPl(l),  DPI (2),  DPI (3), 
DPI (4) 

1000  FORMAT  (13,  F7.0,  F7.1,  F7.0,  F7.0,,F7.0,  F7.4,  F7.4,  F7.4,  F7.4)  ' 

4 

PUNCH  1001,  DATAPT,  PT1 (1) , PTl (2) , PT1(3),  PT1(4),  FG,  PO,  PT5(1), 
PT5C2),  PT5  (3)  , PT5  (4 ) 

1001  FORMAT  (13,  F7.4,  F7.4,  F7.4,  F7..4  F7.0,  F7.4,  F7.4,  F7.4,  F7.4, 

F7.4) 

PUNCH  1002 . SUBPT,  RAD,  CO,  Co2,  THC,  H2,  NO,  IJC2 

FORMAT  1002  (16,  F7.4,  F7.0,  F7.2,  F7.1,  F7.1,  F7.1,  F7.1} 


y > c. 1 • 


!' 


4 


* 


^6vek 


It 


OVERALL  DATA  SUMMARY 
^/B  CATALYTIC  FLAME  STABILIZATION  J85-5 


""JUNE  21,  1978 

/Operating  condition  - max  a/b 


V 


Disk  1: 


Design  C 
Corderite 
Pt/Pd  Catalyst 


Disk  2: 


Design  D 
Corderite 
Pt/Pd  Catalyst 


Disk  3:  No  Disk 


to 


SiANNEL 

QUANTITY  AVE  V 

1 

T5 

2 

TO 

3 

WFM 



WFAB 

A8 

13 

HG 

14 

DPI (1) 

15 

DPI  (2) 

16 

DPI (3) 

17 

DPI (4) 

18 

PT1(1) 

19 

PT1 ( 2 ) 

20 

PT1 (3) 

21 

PT1 (4) 

22 

FG 

24 

PO 

25 

PT5 (1) 

26 

PT5 (2) 

27 

PT5 (3)  . 

28 

PT5 (4)  | 

DATA  POINT  001 

MAX 


Subpoint.  Radial  Position 


VOLTAGE  AVE  MAX 


EMISSIONS  DATA 


i 


MIN 


Ac 


°l  [0 
H2  NO 


f 

T 


1 

2 


6.77in 
6 .50in 


25.14psia 

26.00psia 


940ppm 

920Ppm 


9.60% 

9.20% 


50ppm  25ppm  5ppa» 
50ppm  20ppm  5ppra 


If 


(/ 

K02 

lOppra 

lppta 
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Si  S_ 

i i 

SS  *s 

ii  SSSSSiSS 

i%.  y 

i SS  i 

Hit 

SS  Ji 

S3  5 ii 

S3 

Hit  3. 

SS  Si 

S i ♦ 4 

« $3  S Si 

SSSiS 

i I i 

Si  iS 

SS 

Sz  SS  SSSSSii 

S$ 

iiiS  £ 

i £ i i 

S Six 

i S S S i 5 

S3  S 

i iS  ?. 

SiSiSiSS 

4 S i i 

SSij  S3  Si  Si 

if  * S3 

>4/  V 

® I £w 

Si  Si 

SS  SS 

SS  Si 

Si  ?- 

Si  SS 

Si  Si 

ii  SS  Si 

SSSSSSii  Si 

ii  3 

SSSiiS 

Si  iS 

SS  Si 

SS  SSiiiS 

INDEX  X 

Y 

1 

.64344 

10.9s 

CO 

2 

2 .24420 

11.5 

uJ 

3 

3.909*14 

12. 19 

Pu 

4 

4.93270 

12.75 

0 6 

5 

5 .76  3 4'J 

13.  3i 

o J 

6 

6 . 5G w 3 u 
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0 J 

( 
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00 
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Abstract 

Electrochemical  impregnation  is  a promising  method  to  produce  cadmium 
electrodes  for  nickel  cadmium  batteries.  Due  to  the  complicated  chemical 
and  physical  processes  that  are  involved  in  the  impregnation  of  the  cadmium 
materials  into  the  interstices  of  the  porous  nickel  plaque  material,  wide- 
varying  operating  conditions  has  been  reported  in  the  literature.  This  work 
is  an  effort  to  understand  the  electrochemical  reactions  which  occurr  during 
the  impregnation  process  via  cyclic  voltammetry  on  nickel  and  cadmium  elec- 
trodes in  cadmium  nitrate  solutions.  The  reactions  identified,  for  experi- 
ments on  both  nickel  and  cadmium  electrodes, are  the  deposition  of  cadmium 
hydroxide,  the  reduction  of  cadmium  hydroxide  and  the  reduction  of  cadmium 
ions  to  cadmuim.  It  is  believed  that  the  later  reaction  occurs  only  on  the 
cadmium  surface.  The  voltammetric  data  on  the  nickel  electrode  indicats 
that  a passivating  film  is  deposited  on  the  nickel  electrode.  This  species 
is  believed  to  be  cadmium  oxide  and  could  occur  as  the  reduction  reaction: 

Cd++  + H30+  + 3e“->  CdO  + 3/2  H2 

Nickel,  by  its  nature  is  a good  catalyst  for  the  hydrogen  evolution  reaction, 
may  catalyze  the  above  reaction.  It  appears  that  the  formation  of  this 
passive  film  could  be  deleterious  to  the  Air  Force  electrochemical 
impregnation  process  for  the  manufacturing  of  cadmium  electrode. 


FOREWORD 


This  report  presents  the  results  of  cyclic  voltammetric  studies  of  the 
reactions  that  occur  during  the  electrochemical  impregnation  of  cadmium 
electrodes.  Dr.  Yuen-Koh  Kao,  assistant  Professor  of  Chemical  Engineering 
at  the  University  of  Cincinnati  was  the  principal  investigator.  This  work 
was  conducted  during  the  summer  of  1978  when  he  was  a Faculty  Fellow  at 
the  Ohio  State  University  in  the  ASEE-USAF  Summer  Faculty  Program  (WPAFB). 
Acknowledgements  are  due  to  Dr.  Joseph  T.  Maloy  (AFAPL/Senior  Investigator) 
for  setting  up  the  experimental  apparatus  and  many  helpful  discussions, to 
Dr.  John  J.  Lander  AFAPL/POE-1  for  his  suggestions.  Technician  support  was 
provided  by  Mr. John  Leonard  of  the  Aerospace  Power  Technical  Support  Branch 
(AFAPL/TFP) . 


Prepared 


Reviewed 


by. 


■ y 

H l' 

(t~U 

Yuen^j 

Koh  Ka 

V 

0 

by. 


qmm 


R. A.  MARSH 


INTRODUCTION 


Various  methods  of  manufacturing  cadmium  electrodes  exist  in  the 
patent  literature.  These  methods  can  be  classified  into  three  categories  (1): 
(a)  impregnated  nickel  sinter  type,  (b)  pressed  powder  type  and  (c)  electro- 
chemical impregnation.  The  last  method  is  most  attractive  because  it  is 
simple  in  operation  and  can  in  theory  achieve  high  loading  of  active  material 
in  a single  impregnation  cycle  at  lowest  cost. 

In  the  electrochemical  disposition  process,  nickel  plaque  material  serves 
as  a cathode  in  an  electrolysis  cell.  The  cadmium  or  cadmium  hydroxide 
is  deposited  onto  the  substrate  from  a cadmium  nitrate  solution.  This 
deposition  of  Cd(0H)o  may  be  caused  by  the  large  amount  of  hydroxide  ion 
near  the  electrode  through  either  the  direct  liberation  of  H2  at  the  electrode: 

2 H2O  + 2e  ' — * H2  + 2 OH  " 
or  the  reduction  of  nitrate  ion: 

H2O  + N03  + 2e;  N02  + 2 0H~ 

• • • • • 

6%H20  + NO3  + 8e-*NH3  + 90H~ 

The  deposited  cadmium  hydroxide  may  then  be  reduced  to  cadmium  metal  by  the 
following  reaction: 

Cd  (OH) 2 + 2e~  Cd  + 20H_ 

If  pH  near  the  electrode  is  high  enough  to  favor  the  existance  of  cadmite  ion, 
it  can  be  reduced  to  cadmium  by  the  following  reaction: 

Cd  (0H)3%  2e"  — ► Cd  + 30H" 

Cadmium  ions  can  also  be  reduced  directly  to  cadmium  metal: 

Cd+++  2e”  — •»  Cd 

Under  suitable  potentiostatic  condition,  one  or  more  of  the  above  reactions 
could  occur  during  the  impregnation  process. 

The  success  of  electrochemical  impregnation  method  has  been  achieved 
under  wide-varying  operating  conditions.  There  are  several  variables  in  the 
electrochemical  impregnation  process.  Beauchamp  (2)  in  his  patent  emphasizes 
the  fact  that  high  impregnation  bath  temperature  keeps  the  size  of  cadmium 
hydroxide  crystals  small  and  thereby  obtaining  an  active  electrode.  The 
pH  of  the  impregnation  solution  is  another  controlling  factor.  In  Pickett's 
patent  held  by  Air  Force  (3),  the  pH  of  the  cadmium  nitrate  bath  is  maintained 
between  3-5  by  adding  proper  amount  of  nitric  acid.  In  Beauchamp's  process,  the 
pH  is  controlled  by  "buffering"  the  cadmium  nitrate  solution  with  sodium 
nitrite.  The  current  density,  based  on  apparent  electrode  area  is  another 
important  controlling  parameter,  varies  from  4-8  amps/nr  in  Beauchamp's 
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process  (2)  to  as  high  as  1.2-1. 6 amps/in2  in  the  Pickett  process  (3). 

Bulen  in  his  patented  process  (4)  claims  that  the  electrode  impregnated  by 
alternate  current  technique  retains  much  of  its  capacity  after  cycling  as 
compared  to  electrode  fabricated  by  other  methods. 

The  electrochemical  impregnation  of  cadmium  into  the  interstices 
of  a porous  substrate  is  very  complicated  involving  both  chemical  and  physical 
processes.  The  diversity  of  methods  in  the  patent  literature  suggests  that 
further  improvement  of  the  state-of-the-art  of  the  said  process  is  not 
possible  without  further  understanding  of  the  related  electrochemical  reactions. 
The  understanding  of  these  electrochemical  reactions  shall  provide  a baseline 
whereupon  further  optimization  and  refinement  of  the  manufacturing  process  can 
be  attempted. 

The  electrochemistry  of  the  Cd/Cd( OH) 2 electrode  in  concentrated  alkaline 
solution,  typically  KOH  as  in  Ni/Cd  battery,  has  been  studied  extensively.  (5,6) 
There  is,  however,  little  information  available  about  the  electrochemical 
reaction  in  cadmium  nitrate  solutions. 

OBJECTIVE 

It  is  the  objective  of  this  study  to  investigate  electrochemical  reactions 
of  Cd/Cd(0H)2  deposition  as  occurring  on  cadmium  and  nickel  electrode  surfaces 
in  CdtNOjU  solutions,  especially  the  role  that  the  nickel  plays  in  the  deposition 
process.  Also  investigated  is  the  effect  of  pH  to  the  electrodeposition 
process  in  the  fabrication  of  cadmium  electrodes. 

EXPERIMENTAL 

Cyclic  voltametry  experiments  were  conducted  with  a Princeton  Applied 
REsearch  Model  170  Electrochemical  System.  The  nickel  and  cadmium  micro- 
electrode  were  constructed  in  house  at  AFAPL.  The  nickel  electrode  was 
constructed  by  forcing  a 0.05  inch  diameter  nickel  wire  through  a Teflon 
sleeve  and  grinding  the  end  flat.  The  cadmium  electrode  was  constructed 
by  encasing  a cadmium  strip  in  a lucite  rod  and  again  grinding  the  end  flat. 

The  square  cadmium  electrode  had  an  area  of  0.01  in2.  The  reference  electrode, 
also  made  in  house  at  AFAPL,  was  an  aqueous  saturated  colonel  electrode 
fitted  with  a side  arm  salt  bridge  with  a pin-hole  junction  at  the  end. 

The  counter  electrode  was  a strip  of  bare  cadmium  metal. 

All  experiments  were  conducted  in  a covered  400  ml  beaker.  The 
electrolyte  was  dearea ted  and  held  under  helium  atmosphere.  The  electrolyte 
was  either  2M  Cd(N03)2  or  1M  KNO3.  The  pH  of  the  solution  is  controlled  by 
adding  small  amount  of  nitric  acid.  All  experiments  were  conducted  in  a 
quiscient  solution. 

For  each  run,  the  electrode  was  polished  with  4/0  energy  paper  immediately 
before  use  to  assure  a fresh  metal  surface.  The  electrode  is  then  rinsed 
with  distilled  water  before  it  was  put  into  the  cell. 


RESULT  AND  DISCUSSION 


To  understand  the  role  of  nitrate  ions  in  the  deposition  process,  both 
nickel  and  cadmium  microelectrode  were  subjected  to  cyclic  voltammetry  in 
the  KNO-j  solution.  The  results-  are  shown  in  Figures  1 and  2.  There  is  a 
noted  absence  of  any  detectable  faradic  process  in  the  range  -0.4  to  -0.8 
vs.  SCE. 

Figure  ( 1 . a ) is  a voltammogram  of  nickel  electrode  in  2M  Cd(N03)2  and 
1M  KNO3  solutions  at  pH  = 2.6.  The  scan  covers  range  between  -0.4  to  -0.8V 
vs.  SCE,  well  into  the  cathodic  background  of  nickel  electrode.  There  is  a 
significant  faradic  process  occurs  on  the  initial  scan  out  to  the  cathodic 
direction.  The  potential  where  this  process  occurs  varies  with  pH,  from 
-0.624V  vs.  SCE  at  pH  = 1.5  to  -0.678V  vs.  SCE  at  pH  = 6.0.  This  wave  appears 
only  at  the  initial  scan  and  causes  the  passivasion  of  the  electrode  surface. 

The  surface  species  is  believed  to  be  cadmium  oxide.  More  on  this  will  be 
discussed  below. 

On  the  return  scan,  another  faradic  process  occurs  at  -0.63V  vs.  SCE. 

This  is  believed  to  be  the  reduction  of  cadmium  hydroxide  to  cadmium  metal 
as  was  indicated  by  equation  (3).  This  cadmium  hydroxide  is  presumably 
deposited  onto  the  electrode  at  negative  potentials  when  large  amounts  of 
hydroxide  ions  are  generated  near  the  electrode.  The  cathodic  background 
process  could  either  be  the  reduction  of  nitrate  ion  and, its  product  as 
indicated  by  equation  (2)  or  the  reduction  of  water  evolving  hydrogen  as 
indicated  by  equatior  (1).  When  the  background  hydroxide  ion  generation  reaction 
is  reduced  by  sweeping  less  into  the  cathodic  background  as  was  done  in  the 
next  experiment  in  wnich  the  sweep  range  was  between  -0.4  and  -0.7V  vs.  SCE 
as  shown  in  Figure  (3.b).  There  is  not  hydroxide  ions  generated,  so  cadmuim 
hydroxide  is  not  precipitated  and  consequently,  there  is  no  current  due  to  the 
reduction  of  cadmium  hydroxide  on  the  return  scan.  In  addition,  the  cadmium 
oxide  film  cannot  be  stripped  away  and  the  electrode  is  passivated. 

Figure  (2. a)  shows  the  multiple  cyclic  voltammogram  of  cadmium  electrode 
in  the  same  solution,  2M  Cd  (NO3)?  and  1M  KNO3,  at  various  pH  = 2.6.  The  range 
of  sweep  was  between  -0.6  to  -0.8V  vs.  SCE.  The  narrow  scan  range  was  chosen 
to  prevent  significant  dissolution  of  cadmium  electrode  which  would  occur  if 
the  electrode  were  subjected  to  potential  much  more  anodic  than  the  rest 
potential,  about  -0.61V  vs.  SCE.  There  is  a distinct  difference  between  the 
vol tammograms  of  the  cadmium  electrode  and  the  nickel  electrode  shown  in 
Figure  1.  The  faradic  process  that  occurred  on  the  initial  scan  of  the  nickel 
electrode  that  passivated  the  electrode  is  absent  in  the  cadmium  electrode. 

There  are  three  cathodic  processes  occurring  on  the  cadmium  at  -0.63, 

-0.65  and  -0.67  V vs.  SCE.  Two  faradic  processes  occurred  at  -0.63  and  -0.65V 
vs.  SCE  are  believed  to  be  the  cathodic  reduction  of  two  different  forms  of 
cadmium  hydroxides.  The  active  and  the  inactive  form  of  hydroxide  standard 
reduction  potentials  are  different  by  17  mV  (7).  This  process  happens  only  on 
the  return  scan  indicating  that  the  reactant  of  this  process  is  cadmium  hydroxide 
deposited  onto  the  electrode  in  the  cathodic  background  of  hydroxide  ion 
generation. 


The  direct  reduction  of  the  cadmium  ion  is  responsible  for  the  current 
which  occurred  at  -0.69V  vs.  SCE.  This  process  does  not  appear  on  the 
nickel  electrode  experiments  shown  earlier  in  Figure  1.  Ths  reason  behind 
this  phenomenon  is  that  this  process  can  only  happen  on  the  cadmium  surface. 

In  the  earlier  experiments,  on  a nickel  electrode,  the  cadmium  surface  that 
was  created  by  the  cadmium  hydroxide  reduction  was  stripped  away  in  the  anodic 
process.  Therefore,  the  cadmium  ion  reduction  process  at  0.69V  should  occur 
if  the  cadmium  generated  in  the  nickel  electrode  was  scanned  between 
-0.6  to  -0.8V  vs. SCE  except  the  first  scan  which  starts  at  -0.4V  vs  SCE. 

The  result  shown  in  Figure  (3)  supports  the  above  theory  by  the  appearance  of 
the  reduction  wave  at  -0.69V  vs  SCE  when  the  anodic  stripping  of  the  cadmium 
ion  was  stopped.  It  is  therefore  concluded  that  the  faradic  process  occurs 
at  -0.69V  vs  SCE  is  due  to  the  reduction  of  cadmium  ion  to  cadmium  metal 
and  this  process  can  occur  only  on  a cadmium  surface. 

The  faradic  process  that  characterizes  the  initial  scan  in  the  cathodic 
direction  which  is  responsible  for  the  subsequent  passivation  of  the  nickel 
electrode  is  proposed  to  be  the  following  process: 

Cd++  + H30+  + 3e-  — CdO  + 3/2  H2  (6) 

This  process  occurs  only  on  the  nickel  electrodes  and  is  believed  to  be 
catalyzed  by  nickel  which  is  a good  hydrogen  scavanger.  The  cadmium  oxide 
film  formed  cannot  be  removed  electrochemically. 

The  nickel  electrode  was  subjected  to  different  sweep  rates  in  electrolytes 
with  pH  value  around  3.5  to  study  the  controlling  mechanism  of  the  cadmium 
oxide  formation.  The  results  are  shown  in  the  following  table: 

Sweep  Rate  Peak  Current  ip/v^ 

v (mV/sec)  ip  (jtA  ) 


n 

164 

51.86 

20 

210 

46.96 

50 

320 

45.25 

100 

446 

44.60 

200 

700 

49.50 

It  is  thereby  concluded  that  the  cadmium  oxide  formation  reaction  on  the 
nickel  electrode  is  controlled  by  the  diffusion  rate  of  either  cadmium  or 
hydronium  ions. 

depends  on  the  pH  of  the 
ues  are  -0.57V,  -0.61V  and 
rolyte  are  1.5,  3.3,  and  6.0. 
The  cadmium  hydroxide  reduction  current  does  not  depend  on  the  pH  of  the 
electrolyte. 

Figure  (5)  shows  the  pH  dependence  of  the  electrochemical  reactions 
at  the  cachiium  electrode.  In  general,  reaction  currents  of  all  faradic 
processess  increases  as  pH  of  the  solution  decreases. 


The  above  cadmium  oxide  formation  process 
electrolyte  as  shown  in  Figure  (4).  E ,/?p  val 
-0.64  V vs  SCE  when  the  pH  values  of  trie^elect 
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CONCLUSION 


The  results  of  this  study  reveals  the  complicated  sequence  of  reactions 
that  takes  place  in  the  electrochemical  impregnation  process  for  the 
fabrication  of  cadmium  electrode.  A cathodic  reduction  process  was  identified 
in  the  initial  cathodic  scan  of  nickel  electrodes  in  a cadmium  nitrate  solu- 
tion. This  process  is  argued  to  be  the  nickel  catalyzed  cadmium  oxide  for- 
mation, which  is  responsible  for  the  passivation  of  the  electrode.  This 
process  is  pH  dependent  and  becomes  more  cathodic  in  a more  basic  solution. 
There  may  be  two  reducible  forms  of  cadmium  hydroxide  deposition  at  potential 
sufficiently  negative  for  hydroxide  formation  as  indicated  by  the  two  faradic 
processes  that  appear  upon  scan  reversal.  Direct  reduction  of  cadmium  ion 
occurs  at  a more  cathodic  potential  than  cadmium  hydroxide  and  this  reaction 
can  happen  only  on  a cadmium  surface. 

The  above  results  has  its  implications  in  both  the  electrochemical 
impregnation  process  for  making  cadmium  electrodes  and  the  performance  of 
a cadmium  electrodes  in  an  alkaline  buffer.  Lowering  the  pH  reduces  the 
reduction  potential  of  the  cadmium  oxide  formation  on  the  nickel  surface. 

This  may  be  the  reason  that  electrochemical ly  impregnated  cadmium  electrodes 
formed  under  acidic  condition  and  fails  to  retain  its  capacity  upon  cycling. 
The  complicated  sequence  of  electrochemical  chemical  reactions  indicates 
the  need  of  proper  potentiostatic  control  of  the  impregnation  process. 

The  nickel  substrate  probably  plays  an  important  role  in  the 
passivation  of  the  cadmium  electrode.  Nickel  is  a good  electrocatalyst  for 
hydrogen  evolution  and  may  thereby  promote  the  formation  of  the  irreducible 
cadmium  oxide  film  that  passivates  the  electrode.  The  nickel  surface  when 
exposed  to  a cadmium  negative  electrode  may  cause  the  permanent  loss  of 
capacity  by  the  formation  of  this  passivating  film  during  the  changing  cycle 
of  the  battery. 

Since  it  is  believed  that  hydrogen  dissolves  extensively  in  metallic 
nickel,  this  nickel  electrode  may  actually  catalyze  reaction  (6)  by  removing 
hydrogen  gas  as  it  is  formed  at  the  electrode  surface.  This  conjecture  is 
supported  by  the  evidence  (Figure  3)  that  no  passivating  film  is  formed  on 
a nickel  electrode  that  has  a fresh  cadmium  surface  deposited  upon  it  by  the 
reduction  of  cadmium  hydroxide.  It  appears  that  the  formation  of  this  passive 
film  is  delecterious  to  the  Air  Force  electrochemical  impregnation  process  for 
making  cadmium  electrodes.  Work  should  be  initiated  to  make  nickel  plaque 
material  behave  like  cadmium  in  the  deposition  process. 
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Figure  2:  Multiple  acan  voltammetry  of  cadmium  electrode ‘into 
the  catodic  background  in:  (a)  2M  Cd{N0_)2  IMKNOl 
solution,  and  (b)  IM  KNO  solution.  * * 
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Figure  4: Multiple  scan  voltammetry  of  nickel  electrode  in  2M  CdCNO^ ) 


KNO^  solution 
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Figure  5'  Multiple  scan  voltammetry  of  cadmium  electrode  in  21-1  CdCNO-j  )2 
1M  KNO3  solution. 
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drical Tube  by  Using  Laser  Doppler  Velocimeter  (LDV) 

Summary : The  profiles  of  turbulent  velocity  and  intensity  for  a 
sudden  expansion  cylindrical  tube  are  measured  at  both 
center  line  and  several  cross  sectional  areas  by  using 
the  LDV  technique.  The  sudden  expansion  cylindrical 
tube  is  chosen  in  this  experiment  to  simulate  the  ramjet 
combustion  system.  The  scatter  of  velocity  profiles  is 
about  1%  ~ 2%  at  the  low  turbulent  intensity  region  and 
can  be  as  high  as  20%  at  the  high  turbulent  intensity 
region.  Figure  3.  The  turbulent  intensity  is  still 
difficult  to  measure  when  it  is  high,  because  the  per- 
turbation of  velocity  is  very  random  and  fast  at  this 
region.  Figure  4.  In  order  to  obtain  good  data  for 
turbulent  measurements , a computer  is  suggested  to  con- 
nect to  a current  existing  LDV  signal  processing  unit 
thus  a large  number  of  data  can  be  analyzed  by  the 
computer . 
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I . Introduction 

The  technique  of  laser  Doppler  velocimeter  (LDV)  has  been 
used  quite  often  in  fluid  velocity  measurement.  Yeh  and  Cummins 

(1)  introduced  the  technique  of  LDV  in  1964  for  the  steady  state 
laminar  flow  in  a circular  tube.  In  1967,  Goldstein  and  Kreid 

(2)  demonstrated  the  LDV  technique  to  measure  laminar  flow  in 
a square  duct.  Since  then,  many  investigators  (3  - 10)  have 
used  the  LDV  device  to  measure  the  flow  characteristics  in 
different  areas,  such  as,  polymer  solution,  laminar  flow  and 
turbulent  flow;  and  to  study  the  LDV  performance  from  the 
theoretical  stand  point. 

In  general,  a LDV  system  can  be  considered  as  made  up  of 
two  parts,  the  optical  system  and  the  signal  processor.  The 
optical  system  consists  of  the  following  components.  Figure  2: 

(1)  a laser  which  is  used  to  provide  a main  focused  light  source, 

(2)  a beam  splitter  which  is  used  to  separate  the  initial  beam 
into  two  equally  intensified  beams,  (3)  focusing  lens  are 
forcing  two  beams  to  cross  at  point,  and  forming  fringe  pattern 
at  the  crossing  point,  (4)  collecting  lens  are  used  to  collect 
light  into  a photodetector,  and  (5)  photodetector  is  used  to 
convert  the  light  energy  into  the  electrical  energy.  The  fringe 
pattern,  or  measuring  volume,  formed  by  intersection  of  two 
beams  is  the  key  zone  which  makes  LDV  function.  When  a 
particle  moving  with  a fluid  impinges  the  measuring  volume,  the 
light  is  scattered  in  all  directions  and  some  of  it  is  picked 


-2- 


up  by  a photodetector.  The  movement  of  particle  shifts  the 
frequency  of  the  scattered  light  by  an  amount  known  as  the 
"Dopple  shift"  which  is  proportional  to  a component  of  the 
particle  velocity.  By  knowing  the  fringe  space  the  particle 
velocity  can  then  be  calculated.  The  fringe  space  is  the 
function  of  intersection  angle,  8,  and  wave  length,  X,  of  the 
incident  beams,  and  can  be  expressed  as  the  following  equation, 

a = X 

2 sin  0 . 

I 

In  order  to  obtain  good  signal  from  a LDV  system,  the  optical 
system  must  first  of  all  be  properly  aligned. 

Signal  processor  is  a part  of  LDV  device  which  converts 
the  Doppler  frequency  into  a readable  voltage  signal,  from 
which  the  frequency  can  be  calculated.  An  oscilloscope  can 
also  be  used  as  an  auxilliary  device  connecting  to  the  signal 
processor  to  observe  frequency  signal,  and  the  frequency  signal 
can  be  compared  with  the  digital  value  obtained  from  the  signal 
processor. 

In  operating  the  signal  processor  the  approximate  par- 
ticle velocity  has  to  be  known  before-hand,  so  that  the  high 
pass  and  low  pass  filters  can  be  set  at  the  neighborhoods  of 
the  particle  velocity  and  filt  out  "noise"  frequency.  The  noise 
frequency  is  possibly  contributed  by  room  lights,  photodetector, 
and  the  variation  of  particle  velocity  as  it  goes  through  the 
measuring  volume. 

Signal  processor  can  be  categorized  into  two  types:  Counter 
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and  Tracker.  Counter  has  the  high/low  pass  filter  accessory 
and  it  is  required  to  know  the  measurement  velocity  in  advance 
in  order  to  set  the  high/low  pass  filter  at  the  nearby  measure- 
ment frequency.  For  using  Tracker  one  has  to,  first  of  all, 
find  or  "track"  a signal,  and  as  the  signal  is  "locked"  proper- 
ly the  Tracker  will  adjust  automatically  in  15%  of  frequency 
changes.  In  this  study  since  the  velocity  of  the  center  line 
and  the  cross  section  is  changing,  it  is  difficult  to  pre- 
estimate the  velocity  at  each  new  location.  Therefore,  Tracker 
is  used  as  the  primary  signal  processor  to  obtain  the  frequency, 
and  Counter  is  used  as  the  secondary  unit  to  compare  the  results. 

II . Experimental  Apparatus 
(a)  Flow  System 

A 22-inch  long  and  4-inch  ID  plastic  circular  tube 
is  used  as  a testing  section  for  this  study.  Figure  1,  the 
thickness  of  the  tube  is  about  V . Annulus  plates  with 
1-inch  thick  and  2-inch  inside  circular  hole  are  co-axially 
attached  to  the  22-inch  long  testing  tube  which  generates 
a sudden  expansion  flow  pattern  inside  the  tube.  The 
apparatus  is  to  simulate  the  ramjet  combustion  system. 

The  back  end  of  the  tube  is  connected  to  vacuum  cleaners 
and  the  flow  is  sucked  in  from  the  front  end  of  the  tube. 

A capillary  tube  is  located  at  the  inlet  of  the  testing 
section,  and  the  capillary  is  connected  to  a manometer  to 
read  the  inlet  air  velocity.  Two  pieces  of  co-axial  tubes 
are  connected  to  the  both  ends  of  the  testing  section  tube. 
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The  front  end  is  used  to  regulate  the  aerosol  particles 
flow  and  the  back  end  is  used  to  eliminate  the  end  effect 
of  exit  flow. 

The  entire  tube  is  fixed,  and  the  laser  optical 
system  can  be  moved  axially  and  radially  so  that  the  in- 
tersection of  two  beams  - fringe  pattern  or  measuring 
volume  - can  be  adjusted  at  any  axial  and  radial  locations. 
Baking  soda  is  used  as  solid  particles  and  the  particle 
diameter  is  about  0.5-u  in  diameter. 

(b)  Optical  System 

The  major  characteristic  of  optical  system  is  dis- 
cussed in  the  previous  section  and  is  also  depicted  in 
Figure  2.  An  item  which  has  not  been  discussed  is 
Frequency  Shifter.  The  frequency  shifter  is  an  acousto- 
optic cell  (Bragg)  which  is  used  to  shift  the  laser  light 
frequency.  By  properly  shifting  the  laser  light  frequency 
one  will  be  able  to  set  the  high/low  pass  filter  of 
Counter  Signal  Processor  at  constant  levels  during  a 
sequency  of  measurements,  and  also  can  measure  the  revers- 
ible flow  velocity.  For  example  if  one  is  going  to  measure 
the  Doppler  shift  frequency  changing  somewhere  from  10MHz 
to  2 MHz,  and  by  using  frequency  shifter  of  5MHz  with  the 
flow  velocity,  then  the  Doppler  shift  frequency  will  be 
about  15  MHz  to  7 MHz.  Therefore,  one  may  decide  to  set 
the  high/low  pass  filter  at  32  MHz  and  8 MHz  during  the 
sequency  of  measurements. 
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III . Results 

The  profiles  of  velocity  and  turbulent  intensity  along 
the  center  line  of  the  testing  tube  are  depicted  in  nondimen- 
tional  quantities  on  Figures  3 and  4.  The  figures  show  that 
the  velocity  decreases  to  about  25%  of  the  inlet  velocity  when 
X/D  approaches  to  about  10  where  the  flow  is  fully  developed. 
The  turbulent  intensity  increases  from  zero  at  the  inlet  point 
to  the  maximum  at  X/D  equals  to  about  6~8,  and  decreases  down 
to  the  steady  state  region.  At  the  high  turbulent  intensity 
region  the  scatter  of  data  is  large,  especially  for  the  in- 
tensity data,  because  at  this  region  a lot  of  particles  move 
fast  and  exceed  the  frequency  which  eventually  the  signal 
processor  would  lose  to  "track". 

The  velocity  profile  and  turbulent  intensity  for  the 
cross  sectional  areas  are  shown  on  Figures  5 to  8.  At  cross 
section  of  X/D  = 1.25,  it  shows  a negative  velocity  profile 
exists  near  the  wall  which  indicates  the  separation  of  the 
boundary  layer.  Moon  and  Rudinger  (10)  show  in  their  results 
that  the  boundary  separation  exists  at  about  X/D  = 2.0  which 
agrees  with  this  study.  The  turbulent  intensity  for  the  cross 
sectional  profile  is  extremely  difficult  to  measure  near  the 
wall  region,  the  complicated  situation  is  probably  contributed 
by  the  rebounding  of  particles  from  the  wall  which  cause  trans- 
versal motion  of  particles. 

IV.  Conclusion  & Recommendation 

The  turbulent  velocity  profile  of  the  center  line  and  the 
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cross  section  for  a sudden  expansion  of  circular  tube  can  be 
measured  by  using  LDV  technology  with  the  regular  signal 
processor  - such  as.  Counter  or  Tracker.  However,  the  tur- 
bulent intensity  is  still  difficult  to  measure  at  the  regions 
near  the  wall  and  where  the  turbulent  intensity  is  high,  it  is 
because  the  velocity  perturbation  changes  so  randomly  and 
rapidly  at  these  regions . In  order  to  overcome  these  problems , 

it  is  suggested  to  use  a computer  to  collect  and  analyzed  a 

\ 

large  amount  of  data  simultaneously. 
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a : fringe  space 


D : inlet  diameter  of  testing  tube,  2 inches 
r : radial  distance  of  testing  tube 
R : radius  of  testing  tube,  2 inches 
u : local  velocity 
Ua  : center  line  velocity 

Ttl 

TJ 

IN  : air  inlet  velocity 
X : axial  distance  from  inlet  point 
9 : intersection  angle  of  laser  beams 

A : wave  length  of  laser  beam 
u’  : local  perturbation  velocity 
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PRELIMINARY  DESIGN  PROCEDURE  FOR  HIGH  POWER  DENSITY  MHD  GENERATORS 

PAU-CHANG  Ltf* 

Abstract 

The  steps  to  be  taken  in  the  preliminary  design  of  a high  power 
density  MHD  generator  are  formalized.  The  recommended  design  procedure 
starts  with  the  optimum  choice  of  the  combustion  chamber  pressure.  The 
optimization  is  based  on  a semi -empirical  expression  of  the  effective 
power  developed  recently  by  Smith  and  Nichols.  As  a result  of  this  opti- 
mization, a rough  estimate  of  the  realizable  power  density  is  made,  which 
yields  the  order  of  magnitude  of  the  transverse  dimension  (keeping  the 
length-to-diameter  ratio  around  10)  for  the  desired  power  output.  It  is 
then  recommended  that  the  area  variation  (and  length)  be  calculated  on 
the  basis  of  an  isothermal  core  flow.  This  preliminary  shape  of  the  duct 
will  serve  as  the  base  on  which  variations  can  be  made  by  a computer  to 
accomodate  wall  effects,  in  the  detailed  design  stage.  An  alternative 
route  for  the  preliminary  design  is  also  provided  by  using  scaling  laws. 
Starting  with  a well-designed  generator  which  is  demonstrably  high  in 
power  density,  dynamically  similar  units  can  be  produced  using  these 
laws.  The  laws  are  developed  following  the  modern  procedure  of  order- 
ing. Numerical  examples  are  provided  to  illustrate  the  procedure. 

* Professor  of  Mechanical  Engineering,  University  of  Nebraska-Lincoln, 
USAF-ASEE  Summer  Faculty  Research  Fellow,  Wright-Patterson  Air  Foree 
Base,  Ohio. 
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This  is  the  final  report  on  Preliminary  Design  of  High  Power 
Density  MHD  Generators  under  the  USAF-ASEE  Summer  Faculty  Program 
(WPAJB),  administered  at  the  Ohio  State  University,  from  June  5 to  16, 
and  from  June  26  to  August  Id  1978* 

Dr.  J.  F.  Holt  of  the  Aero  Propulsion  Laboratory  suggested 
the  present  study.  The  author  appreciates  greatly  the  opportunity  of 
engaging  Dr.  Holt  in  enlightening  discussions  on  numerous  occasions. 
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SECTION  I 
INTRODUCTION 

Designing  MHD  generators  for  high  power  density  is  currently 
United  to  a snail  circle  of  practitioners,  away  from  the  main  stream 
of  MHD  activities;  the  planning  is  usually  done  in  an  ad  hoc  manner, 
for  individual  cases,  without  stating  clearly  the  approach  followed  or 
the  philosophy  adopted.  To  render  such  planning  more  a science  than  an 
art,  it  is  ths  purpose  of  the  present  study  to  formalize  the  steps  to 
be  taken  in  the  preliminary  design  of  a high  power  density  MHD  generator. 
These  recommended  steps  are  gathered  here  (presumably  for  the  first  time) 
for  one  object  (and  one  object  only):  the  realization  of  maximum  possible 
power  generated  per  unit  volume.  Detailed  design  calculations  (on  a com- 
puter) that  follow  these  preliminary  steps  will  undoubtedly  Indicate 
trade-off  points  for  a variety  of  meritous  features.  But  in  this  re- 
port, only  maximum  power  density  is  being  pursued. 

In  gathering  material  from  a widely  scattered  literature  for  the 
expressed  purpose,  the  author  can  hardly  claim  any  origlnaUty.  Although 
critical  comments,  personal  judgments,  minor  discoveries,  slight  exten- 
sions and  small  variations  abound,  this  report  remains  but  a designer's 
guidebook. 

In  Section  II,  the  recommended  design  procedure  starts  with  the 
optimum  choice  of  the  combustion  chamber  pressure,  after  a brief  des- 
cription of  optimization  calculations  to  be  done  on  the  inlet  Mach  number, 
seeding  ratio,  and  0/F  (oxygen  to  fuel  ratio).  The  optimization  is  based 
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• m 


on  a send -empirical  expression  of  the  effective  power  output  developed 
only  recently  by  Smith  and  Nichols  (ref.  1).  This  new  approach  apparently 


provides  the  most  rational  basis  to  the  design  procedure  to  date. 

As  a result  of  Section  II,  a rough  estimate  of  the  realizable 
power  density  emerges,  which  will  yield  the  order  of  magnitude  of  the 
transverse  dimension  (keeping  the  length/di ameter  around  10)  for  the 
desired  power  output. 

In  Section  III,  it  is  recommended  that  the  area -variation  (and 
the  length)  of  the  generator  be  estimated  on  the  basis  of  an  isothermal 
core  flow.  The  rationale  here  is  as  follows:  The  plasma  conductivity 
varies  exponentially  with  temperature;  therefore,  keeping  the  entire 
duct  uniformly  at  a high  temperature  level  would  promote  high  power 
density.  (In  contrast,  a constant  velocity  design  would  incur  heavy 
temperature  drop  and  pressure  loss;  the  latter  would  also  burden  the 
diffuser  heavily.)  This  preliminary  shape  of  the  duct  will  eventually 
serve  as  the  base  on  which  variations  will  be  made  on  a computer  to 
accomodate  the  wall  effects.  The  final  (computer-aided)  design,  of 
course,  will  not  come  out  isothermal;  but  its  tenperature  variation 
in  the  flow  direction  will  certainly  be  relatively  small. 

Section  III  contains  formulas  developed  for  the  isothermal 
core  flow  through  a diagonal  conducting-wall  generator.  To  the 
author's  present  knowlegde,  these  extended  formulas  for  the  case 
of  diagonal  conducting  walls  are  new. 

In  Section  IV,  the  preliminary  design  is  carried  out  along  a 
different  route.  Starting  with  any  well  designed  generator  which  is 
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demonstrably  high  In  power  density,  either  already  In  operation  or 


in  an  advanced  stage  of  planning,  scaling  laws  can  be  applied  to 


produce  dynamically  similar  units  for  a different  power  output, 


and/or  magnet  stregth,  and/or  fuel,  etc.  The  procedure  can  also 


be  used  to  yield  a dynamically  similar  pilot  unit  which  can  be 


tested  before  embarking  on  the  the  larger-sized  endeavor.  It  must 


be  emphasized  here  that,  if  a unit  has  a power  density  which  is 


maximum  under  the  given  restraints,  its  dynamically  similar  models 


will  deliver  far  smaller  power  per  unit  volume  (being  still  "high", 
possibly)  . 


The  key  modeling  parameter  involved  In  the  scaling  laws  are 


established  in  Section  I V following  a modern  procedure  known  as  the 


ordering  process  (see,  e.g..  Chapter  5 of  ref.  2).  Modeling  (or  seel- 
ing) is  then  carried  out  in  the  classical  manner  (see,  e.g..  Chapter 
U of  ref.  3).  Although  the  resulting  laws  are  identical  with  those 
quoted  in  the  literature  (e.g..  Chapter  7 of  ref.  U),  the  derivation 


presented  in  this  report  is  emnantly  more  convincing. 


Finally,  in  the  short  Section  7,  minor  losses  near  the  walls 


are  discussed.  The  discussion  is  brief  since  these  losses  will  be 


accounted  for  , anyway,  in  the  next  step  of  the  design— the  computer- 


aided  simulation  and  selection. 


It  is  hoped  that  generator  designers  will  find  the  recommended 


procedure  helpful  in  providing  Initial  inputs  to  the  design  of  magnets 


(which  usually  has  to  be  started  simultaneously  with  that  of  generators). 


as  well  as  to  the  sophiscated  numerical  programs  (the  final  design  tools) 


I 
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SECTION  n 

SEIZCTION  or  PRESSURE  UBVEL 

A modern  and  operational  definition  of  design  refers  to  it  as 
"optimisation  tinder  partially  uncertain  constraints • " With  this  defini- 
tion in  mind,  one  may  state  a general  design  philosophy  or  approach  in 
the  form  of  fiTe  steps:  (1)  Assuming  that  the  operation  is  not  too  sen- 
sitive to  changes  of  various  parameters  In  a rather  large  neighborhood 
of  the  optimal  condition,  optimise  the  object  quantity  with  respect  to 
the  parameters  one  after  the  other.  (2)  Establish  a rational  guideline 
for  the  optimisation  with  respect  to  each  parameter.  (3)  Display  a 
number  of  optimal  calculations  over  a range  of  uncertain  values  of 
the  constraining  parameters.  Select  a few  (or  one),  exercising  the 
designer's  judgement.  (U)  Trade  off  (i.e.,  deviate  from  the  optimal) 
for  other  desired  or  required  characteristics.  And,  (5)  simulate  the 
few  cases  which  exhibit  overall  possibilities  on  a computer,  and  make 
a final  decision. 

With  reference  to  the  present  task  of  designing  a MHD  generator 
for  high  power  density,  we  realise  immediately  that  the  object  quan- 
tity to  be  maximised  in  the  above  steps  is  the  power  output  per  unit 
volume  w.  The  parameters  to  be  considered  at  the  outset  are  the  fuel 
used,  the  oxydiser,  the  seeding  material,  the  seeding  ratio,  the  0/F 
ratio,  the  inlet  Mach  number  M^,  and  the  combustion  chamber  pressure 
p°.  (All  products  of  combustion  refer  to  hydrocarbon  fuels . ) 


As  a measure  of  the  effective  power  density  when  loaded  for 


maximum  power,  we  will  adopt  the  following  send. -empirical  formulas 
due  to  Smith  and  Nichols  (ref.  l): 

Faraday  (Segmented)— 


(W<rVJB«,  1 

Diagonal  Conducting  Walls— 


W r* 

eff 


1>-2 (Wo-VV2,  (3*>i 


(f(l  +0t*2)(l  ♦ (3*2) 


— &)orV2B*2,  p*<i 


(1  ♦ ot*2)(l  +(*>*2) 


where  the  asterisk  is  used  to  indicate  evaluation  of  quantities  at  cer- 
tain reference  point  (the  inlet,  for  example),  and  where 
{3  • Hall  parameter 
<J*  • conductivity 
u • flow  velocity 
B - magnetic  field  strength 


OL-  (Hall  field)/( Faraday  field) 


In  an  empirical  (and  approximate)  manner,  the  formulas  account  for  the 
internal  current  leakage  and  electrode  voltage  drops.  (This  empirical 
aspect  probably  also  prevents  the  first  formula  from  being  a special 
case  of  the  second.) 

For  a given  fuel  mixture,  temperature  level,  and  B , the 


▼s.  (S*  curve  (remembering  that  <T  fj?  because  of  the  pressure  vari- 
ation) shows  a trend  as  sketched  in  Fig.  II-l  (the  curve  marked  dia- 
gonal conducting  walls  being  roughly  for  oL m -fi,  a power-maximising 

value).  It  is  thus  observed  that  veff  is  maximum  when  • 1,  and  that 

* 

weff  ^°creaae8  much  faster  for  decreasing  p below  1 than  Increasing 
* . 

£ above  1. 

A rational  guideline  for  the  selection  of  the  pressure  level, 

* * * 
represented  by  p , now  emerges:  Choose  p such  that  (Z  , for  the  given 

T and  B , equals  1;  and,  if  practical  constraints  force  a deviation, 
make  (2>*  > 1 (rather  than  ■<  1). 

In  following  this  guideline,  there  is  still  the  question  of 
where  to  enforce  it.  Should  the  asterisk  refer  to  the  nossle  exit, 
where  the  magnetic  field  peaks,  or  some  kind  of  average  state?  An- 
ticipating an  isothermal  (or,  approximately  isothermal)  design,  (T 
will  stay  the  same  from  the  inlet  to  the  peak  point  of  the  field. 

Also,  from  existing  designs  where  temperature  drop  is  relatively 
slight,  we  observe  that  the  flow  velocity  hardly  changes  before 
the  peak  of  the  magnetic  field.  Therefore,  it  seems  desirable  to  se- 
lect the  pressure  value  so  as  to  make  ^ » 1 (or  slightly  above)  at 
the  peak  of  the  field.  Thus,  with  a computer  print-out  of  j3(T,  p,  B) 
in  hand  (actually,  ^°cB),  one  will  select  this  pressure  level  ac- 
cordingly. 

Once  the  pressure  at  the  peak  point  is  chosen,  one  can  add  an 
empirical  percentage  (~10£)  to  it  to  obtain  p^  at  the  inlet.  (In 
practice,  this  amounts  to  a minor  option  only,  since  a preliminary 


design  Is  routinely  within  a band  of  variation  wider  than  10$. ) After 
this,  the  conbustion  pressure  can  be  estimated  for  every  given  M^ 
(knowing  the  equivalent  ratio  "1  of  the  specific  heat  capacities  of 
the  plasma)^  Incidentally,  also  links  the  generator  temperature  with 
the  combustion  temperature  T°. 

As  an  example,  toluene  ♦ Og  ♦ CsgCO^  at  the  stoichiometric  0/P 
ratio  is  treated  in  the  manner  Just  described  for  a peak  magnetic 
field  of  It  T.  The  result  is  summarised  in  Table  -II -1.  Two  effective 
combustion  temperatures  are  employed  in  the  table  since  the  tempera- 
ture level  at  the  nossle  entrance  may  be  raised  somewhat  by  an  in- 
crease in  combustion  pressure,  and/or  an  improvement  in  the  combus- 
tion chamber  design,  etc. 

It  must  be  emphasised  once  again  that  the  optimal  pressure 
level  is  chosen  independently  of  optimisation  with  respect  to  the 
other  operational  parameters.  If  all  parameters  are  optimal,  the 
effective  power  density  will  be  the  maximum  of  maxima;  otherwise, 
it  will  only  be  the  maximum  for  a given  (non-optimal)  set  of  para- 
meter values.  As  a matter  of  fact,  in  Table  II-l,  although  the  Mach 
number  is  in  the  optimal  range,  the  seeding  and  the  0/F  ratios  are 
both  rather  far  from  being  optimal. 

With  the  aid  of  a computer,  the  effective  power  density  can  be 
optimised  with  respect  to  each  one  of  the  four  parameters,  seeding 
ratio,  0/P  ratio,  inlet  Mach  number,  and  the  pressure  level.  Essen- 
tially, the  computer  is  to  print  out  a chart  showing  the  variation 
of  we^f  with  respect  to  these  parameters;  the  optimal  values  are 


TABUS  II-l 


OPTIMAL  COMBUSTION -CHAMBER  PRESSURE  FOR  STOI 
CHI  (METRIC  COMBUSTION  OP  TOLUENE  AND  OXYGEN 
SEEDED  WITH  CESIUM  CARBONATE 
(L  TESLA) 


! 


then  easily  identified.  Actually,  less  elaborate  study  (ref.  5)  has 


shown  that  M^  should  always  be  around  2 for  products  of  combustion 
to  realise  maximum  power  density  (which  fact  is  also  borne  out  by 
a detailed  numerical  example  in  ref.  1).  Thus,  for  a preliminary  de- 
sign, anticipating  certain  practical  ranges  of  the  seeding  and  0/F 
ratios  (which  should  be,  but  might  not  be,  around  the  optimal  value) 
and  M^  (which  must  be  around  2),  the  selection  of  pressure  level  may 
as  well  be  effected  by  following  the  suggested  guideline  as  exempli- 
fied in  Table  II-l. 

Finally,  each  selection  in  Table  II-l  has  an  anticipated  power 
density  associated  with  it;  we  will  quote  four  numbers  here  as  illus- 
trations : 

e 

10%  seeding,  - 2.1— 

T°  - 3100  K:  w —60  MW/m3 
T°  - 3U00  Kj  w _ 320  MW/m3 
30%  seeding,  M^  » 2.1— 

T°  • 3100  Kj  w~  125  MW/*3 
T°  - 31*00  K:  w ~ 620  MW/m3 

From  these  figures,  the  generator  volume  can  be  estimated  for  a desired 
power  output.  In  addition,  if  an  empirical  length-to-diameter  ratio 
(~10)  is  adopted,  based  on  a compromise  between  end  and  wall  effects, 
the  linear  sise  of  the  generator  can  also  be  estimated. 


: 


SECTION  III 


L 


ISOTHERMAL  DESIGN 


After  the  optimization  discussed  In  the  previous  section , the 
design  philosophy  calls  for  the  determination  of  the  duct  shape  (i.e., 
the  area  variation  in  the  flow  direction),  as  far  as  the  core  is  con- 
cerned, for  isothermal  generation  of  electricity.  (The  result  will 
serve  as  the  base  shape  upon  which  the  wall  effects  will  be  added  in 
the  computer  simulation  that  follows  the  preliminary  design.)  To  this 
end,  let  us  first  collect  all  the  governing  equations  (referring  to 
Fig.  in-1)  which  describe  a general  core  flow: 


a uA  • constant  (■  /\U^A^ ) 

(1) 

puu'  ■ -p'  ♦ jyB 

(2) 

/>u{cpT«  ♦ (u2/2)'}  - * JyE7 

(3) 

p - p/RT 

(U) 

\ -otEy 

(5) 

K - Ey/uB 

(6) 

ix  - {<T/(1  ♦ ^2)}(uB){ea[  - <3(K  - 1)} 

(7) 

^ Jy  - {<*7(1  ♦ (32)}(uB){*£K  ♦ (K  - 1)} 

(8) 

where  prims  denotes  differentiation  with  respect  to  x;  subscripts  x and 
y indicate  specific  components;  f>.  A,  j,  E,  R,  cp,  and  K are  respectively 
the  plasma  density,  cross-sectional  area  of  flow  passage,  current  den- 
sity, electric  field,  gas  constant  of  the  plasma,  specific  heat  capa- 
city at  constant  pressure  of  plasma,  and  the  loading  factor;  and,  all 
quantities  are  local.  In  Equation  (1),  p^  and  u^  are  known  from  the 
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procedure  presented  in  the  previous  section;  can  also  be  decided 
roughly  on  the  power  desired  as  explained  at  the  end  of  the  previous 
section. 

Zh  the  above.  Equations  (2)  through  (6)  can  be  easily  combined 
into  the  following  key  equation: 

(p/iir)cpT'  - p*  + {<3/(1  ♦ <S2)](uB2){<A2  ♦ (K  - l)2}  (9) 

This  equation  is  completely  general. 

Before  we  narrow  down  our  scope  to  the  isothermal  case,  let  us 
point  out,  in  passing,  one  interesting  fact  about  the  general  Equation 
(9):  For  the  case  where  c lm  -(i and  K - 0.5,  it  reduces  to  the  form 

(p/BT)CpT'  » p'  + (y(uB2/U)  (10) 

which,  being  exactly  the  case  for  ($  » 0 (and  K • 0.5),  has  been  ex- 
tensively studied  (for  all  values  of  K)  in  the  literature  (see  ref.  6 
for  a unifying  approach).  The  case  with  ol  • -jj  and  K «0.5  is  practic- 
ally of  great  interest  since  o(  - maximizes  the  power  density  (with 
respect  to  at),  while  K *0.5  is  not  far  from  being  optimal  tinless 
deviates  very  much  from  1.  For  such  a case,  all  the  available  solutions 
(in  the  absence  of  the  Hall  effect)  can  be  directly  employed  by  substi- 
tuting 0.5  for  K. 

Going  back  now  to  the  isothermal  case  exclusively,  we  have  from 
Equation  (9) 

p'  - - {0/(1  ♦ p2)} (uB2){o*?K2  ♦ (K  - l)2}  (11) 

But,  from  Equation  (2)  and  (8), 

{0/(1  ♦(J2)}(uB2)  -{puu»  + p'}/{ol(iK  ♦ (K  - 1)}  (12) 

Thus,  combining  Equations  (11)  and  (12),  we  have 
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... 


p*  - -{c*2K2  ♦ (K  - l)2}{f>uu'  ♦ p'}/{olfK  ♦ (K  - l)j  (13) 
or, 

k{k(«s62  ♦ 1>  ♦ (o<|i  - l)}p'  - -^C2K2  + (K  - l)2jl(p/BT)(uu') 
Integrating,  we  obtain  finally  the  relation  ^or  constant  |3,  and  Kj; 


where 


a - 


For  a variable 


o62K2  ♦ (K  - l)2 
(2HT)k{k (ot2  ♦ 1)  ♦ (olp  - 1)] 

^ (■  b/p),  but  with  cl  ■ and  constant  K, 


<i5) 

we  hare 


(16) 

After  establishing  such  a p vs.  u relationship,  we  can  can  go 
back  to  Equation  (2)  and  integrate  it  with  respect  to  x.  The  result, 
with  the  help  of  Equation  (13),  is  as  follows: 


x 


(1  ♦ (i2)dp 
♦ (K  - l)2  | (<ruB2) 


(17) 


where  the  p vs,  u relationship  is  to  be  Incurred  In  the  Integration, 
Note  also  that  Equation  (17)  is  completely  general,  with  varying  C~, 
£ , etc. 

In  any  given  case,  p(x)  obviously  will  become  known  at  this 
step;  u(x)  will  then  emerge  from  the  p vs,  u relationship.  Next, 
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since  p<acp,  A(x)  can  be  calculated  Tla  Equation  (l).  To  determine  the 
total  length  of  the  generator,  one  simply  sums  up  the  power  output  along 
the  duct  until  the  desired  total  output  is  reached. 

(Although  it  is  not  likely  that  the  exit  pressure  would  turn  out 

; 

lower  than  0.2  atm;  but,  nonetheless,  this  value  must  be  checked.  An  exit 
pressure  lower  than  0.2  atm  will  burden  the  diffuser  heavily.) 

Based  on  such  a preliminary  core  design,  end  effects,  wall  losses, 
as  well  as  real-gas  effects  can  be  added  to  yield  the  adjusted  duct  shape. 

It  is  not  known  whether  the  above  "almost-isothermal"  procedure  is  actu- 
ally being  used  by  the  designers  in  the  field.  But  it  is  gratifying  to 
note  that  in  at  least  two  real  cases  (both  supersonic  and  for  high  power 
density)  the  temperture  deviation  is  kept  below  2.5**  it  decreases  by 

U00  kW  generator  (ref.  7)\  and  increases  by  1.3*  * j 

generator  (ref.  8). 


-16- 


about  2.3*  i n the  AVCO 
in  the  Russian  PAMIR  I 


SECTION  IV 


SCALING  LAWS 

The  word  "sealing"  as  used  here  is  restricted  to  mean  dynamic 
and  geometric  similitudes  among  a series  of  MHD  generators.  The  aim 
of  scaling  is  to  ensure  that,  once  one  member  of  the  series  is  probed 
extensively  (either  by  actual  measurements  if  it  is  already  operating, 
or  by  detailed  numerical  simulation  if  it  is  not  yet  built),  the  per- 
formance of  any  other  member  of  the  series  can  be  predicted. 

The  fundamental  tool  behind  scaling  is  dimensional  analysis 
(or,  alternatively,  ordering).  In  principle,  every  physical  phenomenon 
can  be  described  by  a list  of  dimensionless  parameters  (in  addition  to 
geometric  shape,  ratios,  and  angles).  Two  specific  instances  of  that 
phenomenon  are  mutually  transformable  in  a dynamically  similar  manner 
if  they  have  the  same  numerical  values  for  all  the  modeling  parameters, 
with  geometric  similarity  taken  for  granted.  Scaling,  or  modeling  (see 
Chapter  U of  ref.  3),  then  calls  for  keeping  the  dimensionless  para- 
meters the  same  among  individual  members  of  a series. 

In  practice,  if  the  dimensionless  parameters  governing  a certain 
phenmenon  are  many,  a dynamically  similar  series  may  very  well  end  up 
containing  only  one  member.  The  conclusion  of  the  scaling  process  then 
becomes  degenerate  and  trivial— namely,  every  specific  instance  is  only 
similar  to  itself.  This  would  be  the  end  of  scaling,  requiring  indivi- 
dual instances  be  probed  (numerically  or  experimentally)  separately. 


In  order  to  have  a meaningful  correlation  among  different  mem- 


bers  of  a series,  certain  dimensionless  parameters  may  have  to  be  al- 
lowed to  vary  from  member  to  member.  If  the  parameters  left  open  In 
this  manner  are  the  geometric  ratios  and  angles,  we  have  a distorted 
sealing;  otherwise,  we  have  a partial  scaling. 

For  MHD  generators,  a (severely)  distorted  scaling  will  require 
detailed  numerical  simulation  to  compensate  for  the  drastic  deviations 
in  the  predicted  performance.  Since  scaling  is  basically  a tool  used 
to  provide  quick  guidelines  before  detailed  analysis,  a distorted 
scaling  is  of  no  use  to  MHD  generator  design— one  might  as  well  skip 
to  numerical  simulation  directly. 

The  rest  of  this  section,  therefore,  is  devoted  exclusively  to 
the  partial  scaling  of  MHD  generators. 

Applying  the  ordering  process  (see  Chapter  5 of  ref.  2)  to 
Equations  (l)  through  (8)  (and  thereby  ignoring  end  and  wall  effects), 
we  introduce  the  following  representative  quantities: 

Density— — 

Velocity — — u^ 

Area— 

x-coordinate Generator  length  L (which  is  proportional 

to  /J. because  of  geometric  similitude) 
Magnetic  field—  Peak  magnetic  field  B* 

Pressure—  p^ 

Temperature T^ 

Conductivity—  0*^ 
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Current  density W 

Electrical  field-—  u^B* 

Substituting  into  these  equations  the  dimensionless  quantities  p » /Vj^, 
etc.,  with  the  tilde  signifying  nondimensionalization,  we  have 
r puA  » 1 

■JfM^Cpuu')  - -p’  + SjyB 

^U{f*  + [(*-  l)/2]  M^CU2^)'}  - [(r-  * j/y) 

y f-pA 

\ Ex  - ^Ey 


/ 4 + P2)}  ^ tUK  - |3(K  - 1)] 

V.  3y  - {5/(1  ♦ p2)  } (uB^oi(3K  ♦ (K  - 1)} 
where  the  prime  now  denotes  differentiation  with  respect  to  (x/L).  As 
a result  of  this  process,  a number  of  governing  (dimensionless)  para- 
meters show  up  naturally  and  unambiguous ly ; namely, 

S - 0'iuifi*2I/pi 
» Uj/zv/THf^* 

B - B(x/L)  - B(*) 

5=  - T(T,  p) 

(1- $<?,  p))|H 

oil  K»  7 

A series  of  geometrically  similar  generators  must  have  the  same 
numerical  values  or  variations  (for  B,  p , and  (T  ) in  order  to  be  dyn- 
amically similar,  as  far  as  the  core  is  concerned.  Out  of  this  list,  ~f 
stays  around  1.1  for  all  products  of  combustion;  we  can  therefore  omit 


i Li 


it  from  further  consideration. 


from  the  literature,  we  find  that 


(Joe  B/Jf/p 
and 

Q-otT®/pn 

where  m and  n are  universal  constants  (depending  on  the  temperature 
range)  for  all  plasmas  (but  where  the  proportionality  constants  differ 
for  different  plasmas).  That  is  to  say, 

(3  - 


5 - Tm/^a 

So,  <j(T,  p)  retails  the  same  form,  only  the  consideration  of  is 
needed.  Similarly,  although  (S(T,  p)  retains  the  same  form  (assuming 
fixed  B(x)),  must  be  considered. 

In  practice,  is  made  close  to  -(3;  so,  there  is  no  necessity 
to  include  its  variation.  The  loading  factor  K is  either  kept  constant 
for  a series  of  generators,  or  it  is  to  vary  only  slightly;  and,  in  ad- 
dition, its  influence  on  the  power  density  can  be  estimated  using  a 
slug- flow  model.  Thus,  finally  the  list  of  governing  parameters  is  re- 
duced to  S,  Mi}  B(x),  and  . £ (with  0^  embedded  in  S). 

These  governing  parameters  are  also  obtained,  for  example,  by 
Garrison,  Brogan,  Nolan,  et.  al.  (ref.  9),  presumably  through  standard 
dimensional  analysis.  The  parameter  S also  frequently  appears  in  the 


literature  by  way  of  dimensional  analysis;  but,  usually,  it  is  the  only 


!•  r 


one  mentioned 


Out  of  these  four  remaining  quantities,  B(x)  is  not  likely  to 
be  fixed  for  a series  of  generators;  we  will  have  to  start  our  partial 
scaling  by  Ignoring  the  requirement  of  a fixed  B(x).  The  Influence  of 
the  field  strength  is  then  invested  only  with  the  peak  B which  appears 
in  S (and  in  (3^  through  B^).  Such  a partial  scaling  has  been  carried 
out  and  applied  by  Garrison,  Brogan,  Nolan,  et.  al.  (ref.  9) . Although 
it  is  somewhat  fruitful  and  useful,  such  scaling  yields  rather  restric- 
tive modeling  laws. 

Noting  that  the  (3-level  in  a well-designed  generator  is  always 
around  1,  and  that  the  influence  of  the  (i -level  can  be  assessed  separ- 
ately (e.g.,  in  the  manner  discussed  in  Section  II),  this  report  will 
go  one  step  ftarther  and  recommend  a partial  scaling  based  on  S and 
only.  By  relaxing  the  requirement  that  ^ be  kept  constnat  from  case 
to  case  in  a series  of  generators,  more  productive  scaling  laws  are  pos- 
sible; and  the  scope  of  application  is  suddenly  widened,  for  instance, 
taking 

<T-  cT10/p°**  (18) 

where  c is  a coefficient  the  value  of  which  is  available  for  specific 
plasmas,  we  see  that  S - constant  for  different  cases  in  a series  im- 

M 

plies  that 

L p1,5B"2T-10*5/c  (19) 

where  M - constant,  i.e., 

1 

From  this  point  on,  the  subscript  i and  superscript  * will  be 
omitted  for  ease  of  writing. 
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(20) 


UoC  T0’^ 

(noting  that  the  molecular  masses  for  all  products  of  combustion  stay 
roughly  around  35)  Is  also  enforced.  Now,  the  object  in  building  a MHD 
generator  is  to  realize  an  electrical  power  output  W which  can  be  cal- 
culated by  integrating  the  solutions  of  Equations  (l)  through  (8)  with 
respect  to  x.  By  examining  the  dimensionless  forms  of  these  equations, 
we  conclude  that  W in  the  form  of  a dimensionless  parameter  must  come 
out  a function  of  (under  the  present  partial  scaling)  S and  M,  without 
the  necessity  of  actually  solving  the  equations.  To  be  more  specific, 
we  conclude  that 

W/(<Tu2B2L3)  - F(S,  M) 

which  yields,  since  S and  M are  both  kept  fixed. 


W/(gru2B2L3)  * constant 
or, 

W OC  cru^B2!3 

oc  c^VlP/p0’*  (21) 

or,  in  terms  of  the  power  density, 

v oc  A/p0,5  (22) 

Next,  combining  Equations  (19)  through  (22)  in  various  ways,  we  have  the 
following  additional  sealing  laws: 

L oc  w0-375/(c°*25T2-7S0*5)  (23) 

p ^ c0’5!6-6*0-2^  (2U) 

w oc  B1*^0,75!7*7  (25) 


where  Equation  (25)  is  approximate  in  the  sense  that  W^*^2^/!3,  instead 
of  W/L3,  is  regarded  as  being  proportional  to  w.  Furthermore,  Equation 
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(23)  squared,  nutipUed  by  Equation  (21*),  yields 

W ©C  pL2A1#1  (or,  roughly,  pL2)  (26) 

In  applying  these  scaling  laws,  one  mast  always  bear  in  mind  four 
things:  (1)  A(x)  is  fixed;  (2)  Equations  (18)  through  (20)  must  be  satis- 
fied; (3)  there  are  possible  errors  in  ignoring  B(x),  c*£  , ^3,  ans  K;  and 
(U)  deviations  must  be  anticipated  from  end  and  wall  losses.  To  guard 
against  possible  misapplications,  let  us  quote  here  one  counter-example: 

In  applying  Equation  (26)  to  the  same  generator,  one  seems  to  see  that  W 
is  proportional  to  p;  but  this  is  singularly  uninteresting,  since  Equa- 
tion (19)  then  dictates  that  p be  fixed  (for  fixed  B,  T,  and  c).  A cor- 
rect interpretation  of  Equation  (26)  would  be,  for  example , thus:  For  a 
given  generator,  equipped  with  a different  magnet,  the  pressure  level 
must  be  adjusted  according  to  Equation  (19)  in  order  to  operate  in  a 
dynamically  similar  manner  as  when  the  old  magnet  is  used;  then  (and 
only  then),  W will  change  in  direct  proportion  to  p.  (If  nothing  but  p 
is  changed,  the  new  operation  will  not  be  dynamically  similar;  snd  W 
will  not  be  proportional  to  p.)  We  would  also  like  to  take  this  opportu- 
nity to  emphasize  the  fact  that  it  is  the  dimensionless  quotient  w/o'u2B2 
that  remains  fixed  from  member  to  member  in  a series  of  dynamically  si- 
milar generators;  the  power  density  w definitely  will  change  from  case 
to  case.  It  is  especially  important  to  bear  in  mind  that,  if  a generator 
is  designed  for  maximum  power  density,  and  if  a scaled-down  unit  is  first 
built,  the  smaller  unit  is  definitely  not  going  to  be  able  to  claim  maximum 


1 

I 
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power  density 


Among  the  above-quoted  scaling  laws.  Equations  (19),  (23),  (2L), 
and  (26)  have  been  derived  before  by  Rosa  (Chapter  7 of  ref.  U),  using 
an  intuitive,  heuristic,  and  simplistic  argument  which  is  rather  uncon- 
vincing ( the  requirement  of  fixed  M being  absent). 

In  the  rest  of  this  section,  we  will  apply  some  of  the  scaling 
laws  to  scale  up  or  down  some  existing  designs  known  for  their  high 
power  densities.  We  will  use  subscripts  1 and  2,  respectively,  to  denote 
the  base  base  design  and  the  scaled  unit.  Temperature  and  pressure  of  the 
combustion  chamber  are  used  in  the  calculations;  for  fixed  Mach  number, 
they  are  proportional  to  those  at  the  generator  inlet.  The  two  specific 
formulas  used  are  Equations  (19)  and  (22)  which  are  rewritten  as 

- (c1/c2)(p2/p1)1*5(B1/B2)2(T1A2)10*5  (27) 

w2/Wl  - (c2/c1)(T2A1)11(p1/p2)°*5  (28) 

The  results  of  the  scaling  are  summarized  in  Table  IV-1.  (A  part  of  the 
calculation  was  carried  out  by  Dr.  J.  F.  Holt  of  the  Air  Force  Aero  Pro- 
pulsion Laboratory,  the  whoa  the  author  is  indebted  in  many  ways.)  A de- 
tailed description  of  the  base  designs  is  given  in  the  followings 
Base  Design  #1— AFAPL  KIVA-1  (ref.  10) 

Toluene  ♦ 02,  seeded  with  Cs 
p°  - 10  atm,  T°  - 3100  K 
M.  - 2,  B*  - 2.3  T 

V ^ 

W - 200  kW,  w - UO  MW/m3 
-2U- 


T — 


L ■ 0.7  m 

Inlet  dimension  2U.9  mm  x 99.8  ram 
Exit  dimension  72.6  ram  x 1UU. 3 mm 
Base  Design  #2 — Russian  PAMIR- 1 (ref.  8) 

Solid  fuel  (c  ■ 2.22xlO"33  (raho/m)(atm)°'^(K)“^0) 
p°  - U5  atm,  T°  - 3559  K 
M±  - 2.LU,  B*  - U T 
W - 15  MW,  w - 500  MW/m3 
L * 1 ra 

Inlet  dimension  160  mm  x 1U0  mm 
Exit  dimension  160  mm  x 220  mm 
Base  Design  #3— Maxwell  30  MW  Unit  (ref.  11) 

JP-U  + Og,  aeeded  with  Cs 

(c  - 8.U5xlO“3li  (mho/m) (atm)°*5(K)"10) 
p°  - 30  atm,  T°  - 3530  K 
^ - 2.2,  B*  - h T 
W - 30  MW,  w - 200  MW/m3 
L ■ 1.3  m 

Inlet  dimension  200  mm  x 200  mm 
Exit  dimension  U50  mm  x U50  mm 
Base  Design  #U— AVCO  VIKING-1  (ref.  12) 

Toluene  + Oj,  seeded  with  Cs 
p°  - 15  atm,  T°  - 3100  K 
Mj^  - 2.2,  B*  - 2.8  T 
W - 2 MW,  w - U7  MW/m3 


Inlet  dimension  50  mm  x 150  ran 
Exit  dimension  166  ran  x 2U9  non 
In  the  calculation,  the  scaled  unit  always  uses  toluene 
seeded  with  Cs,  with  c - 1.68xlO"33  (mho/a) (atm)0#5(K)*10 . 


TABUS  IV-1 

DESIGN  VIA  SCALING  LAWS 


3100  K 


0.8x  ! 3100  K 


10.5  atm 
105  MW/m3 
3.15  MW 
10.8  atm 
78  MW/m3 
6 MW 


1L.1  atm 
1U2  MW/m3 
U.25  MW 
lit. 6 atm 
105  MW/m3 
8 MW 
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nr 


(CONTINUED) 


I 

lx 

3100  K 

2h  atm 

75.5  MW/ra3 

3.2  Mrf 

I 

32.5  atm 

101  MW/ra3 

U. 3 MW 

i 

| 

t » 

i 

0.9x 

3100  K 



22.5  atm 

78  MW/m3 

3.3  MW 

30  atm 

106  MW/m3 

U.5  MW  j. 

f 

i 

1 

i #1 

\ 

1.8x 

3100  K 

' 

l 

1 

31  atm 

68.7  MW/m3 

2 MW 

Ua.7  atm 

92.6  MW/m3 

2.7  MW 

i 

i 

U7.9  atm 

6U.6  atm 

3300  K 

110  MW/m3 

ia8  MW/m3 

i 

i 

3.21  MW 

a. 3 mw 

SECTION  V 
LOSSES 


The  losses  near  the  walls  and  ends,  which  are  totally  ignored  in 
the  foregoing  sections,  will  eventually  be  accomodated  in  the  computer 
simulation  that  must  intercede  between  the  preliminary  design  and  the 
actual  construction.  However,  a few  qualitative  remarks  about  wall  ef- 
fects can  still  be  quoted  here. 

First  of  all,  since  MHD  generation  is  a volume  phenomenon,  while 
wall  losses  are  surface  phenomena,  the  percentage  loss  out  of  the  total 
power  must  be  proportional  to 


(Surface  of  the  generator)  ^ 1 
(Volume  of  the  generator.)  L 


Thus,  a larger  unit  (scaling  up)  will  suffer  less  from  the  wall  losses, 
and  will  have  a larger  efficiency.  (The  same  trend  is  to  be  expected  also 
regading  the  end  losses,  since  the  end  regions  will  occupy  a smaller  per- 
centage for  larger  units.)  But,  as  a penalty,  a larger  unit  will  be  more 
difficult  to  cool  (to  keep  the  wall  material  at  a manageable  temperature 
level)  since  the  (volumetric)  Joule  heating  per  unit  surface  area  (avai- 
lable for  cooling)  will  be  proportional  to  L. 

Secondly,  there  is  the  Rgynolds  number  as  a measure  of  the  vis- 
cous effect,  which  is  completely  ignored  in  the  preliminary  design.  As 
a rule  of  thumb,  the  scaling  should  not  change  the  Reynolds  number  by 
a factor  of  more  than  3*  Obeying  this  rule,  one  is  usually  sure  that 
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the  effects  will  change  only  quantitatively  by  a rather  slight  degree 
(e.g.,  the  frictioi  coefficient  is  roughly  proportional  to  the  one- fifth 
power  of  the  reciprocal  of  the  Reynolds  number,  also  the  disturbance 
due  to  wall  roughness  becomes  less  for  larger  units).  But,  a drastic 
change  in  Reynolds  number  must  always  be  scrutinized  carefully,  for 
fear  that  some  qualitative  deviation  may  evolve;  the  boundary  layer 
may  separate  from  the  wall,  for  instance. 
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cross  sectional  area  of  (the  core  of)  the  generator 
a constant,  see  Eq.  (l£) 
magnetic  field  strength 
peak  magnetic  field  strength 
proportionality  constant  of  ^Jvs.  p 
a proportionality  constant,  see  Eq.  (18) 
specific  heat  capacity  at  constant  pressure 
electric  field  strength 
a function 

electric  current  density 

loading  factor 

length  of  generator 

Mach  number 

an  index 

an  index 

pressure 

combustion  chamber  pressure 
gas  constant 
a parameter,  see  p.  19 
temperature  (Kelvin) 
combustion  temperature  (Kelvin) 
flow  velocity 

electric  power  output 
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w eletric  power  density  (per  unit  volume) 

w effective  electric  power  density 

OiX 

x coordinate  in  the  flow  direction 

y a coordinate  direction,  see  Fig.  III-l 


oi 

P 

7 

P 

<r 

( )* 
( )» 
n 

<>i 


( )2 


(Hall  field)/(l!araday  field) 

Hall  parameter 

ratio  of  specific  heat  capacities 
density 

electric  conductivity 

reference  quantity 

differentiation  with  respect  to  x or  ( x/L ) 

dimensionless  quantity 

at  the  generator  inlet 

x-  component 

y-component 

of  base  design 

of  scaled  (model)  design 


Units: 

atm  (standard)  atmosphere,  a unit  of  pressure 

MW  megawatt 

T tesla  (10^  gauss) 
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ECONOMIC  MODELING  AND  COMPUTER-AIDED 
PROCESS  PLANNING  FOR  SHEET  METAL  OPERATIONS 

Brian  K.  Lambert 
Texas  Tech  University 

ABSTRACT 

A program  for  developing  an  economic  model  and  a computer-aided  process 
planning  system  for  sheet  metal  operations  is  outlined  as  a part  of  the  pro- 
cessing science  research  effort  of  the  Air  Force.  Such  a program  is  intended 
to  enhance  the  integration  of  design,  planning,  and  manufacturing  functions 
into  a cohesive  system.  An  overall  approach  for  developing  a system  that  will 
be  usable  by  designers  for  economic  evaluation  of  material-configuration 
alternatives  and  by  process  planners  for  determining  cost  effective  process 
operations  and  sequences  is  described.  The  economic  model  concept  is  based 
on  establishing  the  relationships  between  part  features  and  the  cost  components 
of  setup,  processing,  handling  and  tooling.  The  computer  aided  process  planning 
system  will  be  based  on  the  use  of  decision  tables  for  determining  the  technolo- 
gical desirability  of  alternative  processing  methods  at  each  stage  of  production. 
Such  alternative  process  plans  will  act  as  input  to  the  economic  model  for 
identifying  cost  optimum  operations  and  sequences  for  aircraft  sheet  metal  parts. 


I INTRODUCTION  AND  BACKGROUND 


A complex  problem  currently  being  encountered  by  many  industries  is  the 
enhancement  of  the  integration  of  design,  planning,  and  manufacturing  functions. 

One  approach  to  this  problem  is  the  utilization  of  the  design/manufacturing 
system  concept  in  which  a computerized  information  flow  and  decision  making 
process  exists  throughout  the  factory. 

. 

The  use  of  computers  as  an  aid  in  solving  specific  design  and  manufacturing 
problems  has  been  in  existence  for  some  time;  however,  relatively  little  effort 
has  been  made  to  integrate  the  various  individual  applications  into  a cohesive 
system.  An  extensive  plan  to  achieve  such  a unified  system  is  the  Air  Force 
Integrated  Computer  Aided  Manufacturing  Program  (ICAM) . The  ICAM  program  is 
not  intended  to  be  a mechanism  for  additional  substitution  of  computers  for 
man-controlled  functions,  but  rather  may  be  described  as  follows: 

"ICAM  is  a system  that  uses  computers  to  organize  every  step  of  manufacturing- 
from  parts  design  to  physical  location  of  machine  tools  to  shipping-  in  the 
most  economical  and  efficient  mode."  (1) 

Although  the  program  has  numerous  objectives,  a conceptual  goal  of  considera- 
ble importance  to  managers  and  engineers  is: 


"A  generative  planning  capability  whereby  a part  designer  could  not  only 
optimally  design  a part,  but  at  the  same  time  subject  this  part  to  a performance 
evaluation,  and  plan  for  the  most  economical  fabrication  of  the  part  within 
the  constraints  of  schedule,  availability  of  raw  materials  and  variability 
of  materials  or  processes.  Further,  it  could  be  envisioned  that  the  fabrication 
test  could  be  performed  immediately  and  the  part  production  automatically  intro- 
duced into  the  overall  manufacturing  plan."  (2) 

In  establishing  the  basic  approach  to  the  ICAM  program,  the  concept  of  a 
manufacturing  "wedge"  was  introduced.  A wedge  is  defined  as,  "A  collection  of 
subsystems  which  on  the  shop  floor  are  specific  to  a particular  process  - but  in 
other  levels  of  manufacturing  - may  be  only  a part  of  general  support,  management 
and  control  subsystems" (3) . The  manufacturing  technology  selected  for  the  initial 
wedge  concept  treatment  was  sheet  metal  processing.  Several  factors  led  to  the 
selection  of  this  particular  group  of  operations,  one  reason  being  that,  unlike 
machining,  the  aerospace  industry  has  invested  only  to  a moderate  degree  in 
advanced  techniques  for  sheet  metal  fabrication  and  improvement  should  rapidly 
result  in  productivity  increases  and  reduced  costs  (4) . 

As  Indicated  by  the  previous  statements  pertaining  to  the  basic  concepts  and 
goals  of  the  ICAM  program,  an  important  and  necessary  aspect  of  the  program  is 
the  development  of  economic  models  for  various  manufacturing  technologies  - the 
first  being  for  sheet  metal  fabrication.  Such  models  will  enable  the  performance 
of  material  - configuration  - cost  tradeoff  studies  during  the  early  portion  of 
the  design  phase  as  well  as  the  economic  evaluation  of  alternative  processing 
methods. 

At  the  present  level  of  manufacturing  planning  technology,  process  planners, 
in  most  cases,  rely  on  their  own  experience  to  select  the  processing  operations 
and  sequence  to  be  utilized.  Similarly,  designers  must  generally  rely  on  experience 
to  evaluate  the  economic  consequences  of  alternative  part  designs.  Thus,  both  the 
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design  and  process  planning  functions  are  "experienced  based"  rather  than  "knowledge 
based".  This  situation  does  not  allow  an  individual  designer  or  planner  ready  access 
to  a firm  and  broad-based  technology  data  base  which  could  form  the  foundation 
for  systematic  evaluation  of  alternatives.  Consequently,  a need  exists  for  a 
planning  capability  for  evaluating  the  economic  aspects  of  design  and  processing 
alternatives.  In  fact,  this  need  is  apparent  in  all  phases  of  the  design  - manu- 
facturing system  and  is  described,  as  follows: 

"As  evidenced  by  the  description  of  stages  that  follows,  the  need  for  the 
concept  of  a component/subassembly/assembly  evaluation  of  economic  alternatives 
is  present  until  final  production:  (1)  Concept/design  stage:  which  of  the 
manufacturing  processes  that  are  capable  of  producing  components  to  the  desired 
specification,  are  low  cost  and  can  be  executed  in  the  available  or  potentially 
available  plant  facilities?  (2)  Process/material  development  stage:  what  are 
the  cost  effective  manufacturing  processes  and  materials  that  can  be  developed 
to  meet  concept  and  design  requirements?  (3)  Cost  estimation/value  analysis/ 
pre-planning  stage:  of  the  available  or  potentially  available  processes,  which 
sequence  will  yield  economical  manufacture  of  a given  batch  or  series  size  of  a 
given  component/subassembly/assembly?  (4)  Detailed  process  planning  stage: 
which  sequence  of  operations  and  which  operating  conditions  for  each  operation 
should  be  chosen  so  as  to  minimize  the  total  cost  of  manufacturing  a given  batch 
or  series  size  of  a given  component/subassembly/assembly?  (5)  Production  schedu- 
ling and  control  stage:  how  should  production  of  a given  variety  of  components 
through  given  production  facilities  be  scheduled  so  as  to  produce  acceptable  parts 
on  time  and  at  a minimum  cost?" (5) 


PURPOSE 


The  purpose  of  this  study  is  to  structure  an  overall  program  which  will  lead 
to  the  development  of  an  economic  model  and  computer-aided  process  planning  system 
for  sheet  metal  operations.  The  model  and  planning  system  will  be  usable  by  de- 
signers for  economic  evaluation  of  material-configuration  alternatives  and  by 
process  planners  for  determining  cost  effective  process  operations  and  sequences. 
In  addition,  the  economic  model  and  process  planning  system  will  be  constructed  in 
such  a manner  to  provide  for  integration  into  the  overall  ICAM  program  as  well 
as  the  Air  Force  processing  science  research  effort. 


REVIEW  OF  PROCESS  ECONOMIC  MODELING 


The  need  and  potential  applications  of  economic  models  of  manufacturing  pro- 
cesses were  described  in  the  previous  section.  The  following  discussion  presents 
a brief  summary  of  the  current  status  of  economic  modeling  at  the  process  level. 

Interest  in  the  economic  analysis  of  manufacturing  processes  has  been  con- 
centrated primarily  on  machining  operations,  beginning  with  F.W.  Taylor’s  work 
in  the  early  1900' s.  During  the  last  20  years  researchers  have  investigated 
numerous  aspects  of  the  machining  economics  problem  with  the  major  emphasis  being 
on  the  determination  of  the  levels  of  the  machining  parameters  (speed,  feed,  depth 
of  cut,  etc.)  to  achieve  some  specified  criterion  such  as  minimum  production  cost, 
maximum  production  rate,  or  maximum  profit  rate.  Investigations  have  also  been 
conducted  concerning  statistical  and  probabilistic  models,  multiple  station  and 
multiple  tool  machining  systems  (5).  Recently,  various  optimizing  techniques  such 
as  response  surface  methodology  (6,7),  geometric  programming  (8),  chance  constrained 


programming  (9),  convex  mathematical  programming  (10)  and  others  have  been  applied 
to  the  problem. 


The  utilization  of  computers  as  an  aid  to  analyzing  and  planning  machining 
operations  has  also  received  considerable  attention  in  the  last  few  years  (11,  12, 

13,  14,  15). 

Research  in  the  area  of  machining  has  been  of  interest  not  only  because  of  the 
economic  significance  of  the  process  but  also  because  of  the  process  parameter  depen- 
dent nature  of  machining.  That  is,  the  cost  of  machining  operations  can  be  directly 
related  to  the  process  parameters  and  thus  determination  of  the  levels  of  the  machining 
variables  to  achieve  minimum  cost  or  maximum  rate  conditions  is  mathematically  feasible. 
This  situation  is  not  necessarily  the  case  for  certain  other  classes  of  metal 
working  processes.  In  sheet  metal  operations,  for  example,  the  actual  process 
variables  such  as  punch  speed  have  relatively  little  effect  on  overall  cost  when 
compared  to  setup,  handling,  and  tooling  costs. 


With  respect  to  the  development  of  basic  principles  and  methodologies  pertaining 
to  process  economic  analysis,  relatively  little  work  has  been  done.  Colding  has 
discussed  some  general  aspects  of  cost  modeling  (16,17)  and  classifies  manufacturing 
costs  into  six  categories:  (1)  methods,  planning  and  scheduling,  (2)  design  and 
manufacture  of  jigs  and  fixtures,  (3)  "pure"  shop  costs,  (4)  material,  (5)  inventory, 
and  (6)  extra  costs. 

A literature  review  revealed  that  little  effort  has  been  directed  towards  develop- 
ing an  economic  model  for  sheet  metal  operations.  As  previously  mentioned,  unlike 
machining,  sheet  metal  operations  are  not  highly  process  parameter  dependent.  That 
is,  no  process  variables  comparable  to  the  machining  variables  of  cutting  speed  and 
feed  have  a dominant  effect  on  total  cost.  Sheet  metal  operations  appear  to  be  more 
dependent  on  part  material,  shape,  and  size  which,  in  turn,  influence  setup,  handling 

and  tooling  costs  and  thus  total  cost. 

■ . 

One  case  study  has  been  performed  in  an  attempt  to  establish  a computer  aided 
work  planning  system  for  simple  sheet  components  (18).  By  inputing  a part  code,  the 
program  reportly  outputs  blank  size,  material  costs,  an  operations  sequence,  setting 
and  cycle  times,  and  nchining  cost.  Only  some  400  parts  of  simple  configurations 
were  considered. 


I i 


II  OVERVIEW  OF  PROPOSED  SYSTEM 


An  initial  step  in  the  development  of  an  economic  model  for  sheet  metal  processing 
is  the  identification  of  the  materials  currently  being  used,  range  of  part  sizes, 
lot  sizes,  and  types  of  processes  utilized.  A preliminary  analysis  has  been  done 
by  the  Boeing  Company  under  the  CAM  Architecture  - Task  III  (Sheet  Metal  Fabrication 
Technology)  effort.  The  major  results  of  this  study  are  summarized  as  follows  (19): 

1.  Material  distribution:  Of  the  8.5  million  sheet  metal  parts  on  file,  the 
material  usage  distribution  was:  aluminum  87.7%,  stainless  steel  6.9%,  carbon 
steel  3.4%,  titanium  1.6%,  nickel  base  and  other  alloys  .4%.  Examination  of 
survey  data  from  other  companies  indicated  a similar  distribution. 

2.  Part  size  distribution:  90  percent,  of  the  parts  considered  are  produced  from 
sheet  less  than  .10  inch  thick  in  blank  sizes  not  exceeding  8 X 25  inches. 

3.  Lot  size  distribution:  90  percent  of  the  lot  sizes  are  160  parts  or  less  with 
most  lots  averaging  20  parts  or  less. 

4.  Process  distribution:  Figure  1 shows  the  breakdown  of  trimming  and  forming 
operations.  The  data  indicates  that  57  percent  of  the  parts  are  trimmed  by  sequential 
shearing  and  blanking  operations  and  23  percent  are  sheared  only.  Thus,  these  two 
processes  account  for  80  percent  of  the  trimming  operations.  For  forming,  57.4  per- 
cent of  the  parts  are  brakeformed  and  27.8  percent  are  hydroformed;  these  two  processes 
constitute  75.2  percent  of  all  the  forming  operations.  Results  of  an  industry  survey 
indicate  substantial  agreement  in  the  relative  utilization  of  forming  processes. 

This  study  did  not  provide  information  concerning  additional  processing  steps 
such  as  cleaning,  surface  treatment,  deburring,  heat  treatment,  etc.,  nor  was  infor- 
mation included  concerning  the  costs  of  the  various  operations. 

5.  Part  shape  distribution:  Analysis  of  part  shapes  indicates  the  following:  one- 
bend  parts  29.3%,  two  or  more  parallel  bends  in  one  direction  22%,  two  or  more  non- 
parallel bends  in  one  direction  10.5%,  two  or  more  parallel  bends  in  two  directions 
8.9%,  two  or  more  non-parallel  bends  in  two  directions  15.5%,  and  curved  bend  line  13.2%. 

Comments  in  the  Boeing  report  further  indicate  the  need  for  an  economic  model 
of  sheet  metal  operations:  "Since  several  different  processes  contribute  to  the  total 
time,  interface  to  a total-part-cost  computer  model  is  necessary.  In  this  model, 
standard  cost  elements  for  a sequence  of  processes  can  be  added  to  give  an  estimate 
of  the  total  part  cost" (19).  The  report  also  states,  "Generally,  there  is  no  means 
for  selecting  processes  based  on  economic  considerations.  Without  such  an  analytical 
capability,  engineering  will  not  be  able  to  effectively  conduct  component  cost  trade 
studies  based  on  formability  data  during  the  design  state,  nor  will  it  be  possible  to 
most  effectively  implement  a generative  planning  capability" (19) . 

To  perform  an  evaluation  of  alternative  part  designs  and  processing  sequences, 
the  basic  system  structure  shown  in  Figure  2 could  be  utilized.  Based  on  design 
requirements  such  as  stiffness,  strength,  weight,  etc.,  the  designer  provides  an 
initial  attempt  at  specifying  the  material  type,  size,  shape  and  other  relevant 
characteristics  which  will  meet  the  functional  requirements  of  the  part.  This  infor- 
mation provides  the  input  to  the  next  portion  of  the  system. 

If  a group  technology  system  is  available,  then  the  alternative  part  design 
under  consideration  can  be  classified  and  coded  and  the  process  plan  for  the  family 
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Figure  1 Process  Distribution 

(Adapted  from  Reference  19) 


Figure  2 Basic  System  Structure 


Design 

Requirements 


■ Stiffness 

■ Strength 

■Fatigue  Properties 
Weight,  etc. 


Alternative 


Design 


• Material 

• Size 

• Shape,  etc. 


System 

Available^ 


Process 

Planning 

System 


Classify  & Code 


Process  Plan 


Material/Process 

Model 


■Flow  Stress 
■Strain  Rate 
-Temperature,  etc. 


Material-Conf iguration 
Process  Alternative 


Economic 

Model 


-Time  Functions 
-Cost  Functions 
-Tooling,  etc. 


•Cost  Optimum  Part  - Process  Alternative 


«< 


fe 

to  which  the  part  belongs  can  be  generated.  If  a group  technology  system  including 
process  plans  for  various  families  is  not  available,  then  a process  planning  system 
must  be  utilized.  Such  a system  will  be  discussed  in  a later  section.  At  any  rate, 
the  alternative  part  design  and  process  plan  act  as  an  input  to  the  material/process 
model  portion  of  the  overall  system. 


The  development  of  the  material  and  process  models  is  currently  underway  at 
Battelle  Laboratories  (Columbus)  and  is  intended  to  provide  analytical  models  des- 
cribing the  material  behavior  under  various  conditions  of  temperature,  strain, 
strain  rate,  etc.  The  model  for  material  behavior  will  describe  the  limits  to 
which  the  material  can  be  deformed  and  the  process  model  will  describe  the  local 
states  of  stresses  and  strains  in  the  material  during  forming  (20) . Essentially 
then,  the  material  model  answers  the  question  of  what  is  the  material  capable  of 
doing  and  the  process  model  answers  the  question  of  what  is  the  process  asking  the 
material  to  do.  The  net  result  of  the  application  of  these  models  is  the  determina- 
tion of  whether  or  not  the  material  and  process  can  achieve  the  desired  part  chara- 
cteristics. At  this  stage,  revisions  to  the  part  design  may  be  necessary  with  out- 
puts from  the  material/process  model  acting  as  feedback  to  the  design  function. 

Once  this  cycle  has  been  completed,  a feasible  material  - configuration  - process 
alternative  will  have  been  identified.  This  information  can  then  be  transmitted 
to  the  economic  model  and  the  cost  of  the  alternative  under  consideration  can  be 
determined.  The  entire  procedure  can  then  be  repeated  for  other  material  - configura- 
tion alternatives  and  the  most  cost  - effective  combination  selected. 
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Ill  ECONOMIC  MODEL  FORMULATION 


In  constructing  an  economic  model  for  sheet  metal  operations  it  is  first  necessary 
to  identify  all  the  processing  steps  required  to  produce  a given  part.  This  includes 
not  only  the  trimming  and  forming  operations  (discussed  in  the  Boeing  Task  III  report) 
but  also  other  processing  stages  such  as  deburring,  straightening,  cleaning,  heat 
treatment,  hand  operations,  etc.  Methods  for  determining  the  processing  steps 
required  for  a given  material  - configuration  will  be  discussed  in  the  section  on 
Process  Planning  Methods.  In  addition  to  specifying  the  operations  to  be  performed, 
the  types  of  costs  to  be  considered  must  be  determined  and  the  relationships  between 
these  various  costs  and  some  set  of  part  characteristics  must  be  established.  Ulti- 
mately, predictive  equations  for  manufacturing  costs  as  a function  of  part  character- 
istics must  be  developed. 

On  a macro  basis,  the  total  cost  for  producing  a manufactured  product  would 
include  direct  materials  and  labor,  factory  overhead,  commercial  expenses,  and 
selling  cost.  These  major  cost  components  are  described  in  Table  1,  and  examples 
of  the  various  elements  which  make  up  these  costs  are  illustrated.  This  study, 
however,  is  concerned  with  an  economic  model  at  the  individual  part  or  process 
level  (what  might  be  termed  a micro-economic  model)  and  will  concentrate  on  material 
costs,  labor  costs,  and  manufacturing  overhead.  The  interpretation  of  these  costs 
for  the  purposes  of  this  study  are  discussed  in  the  following  paragraphs. 

The  direct  material  cost  refers  to  the  cost  of  raw  material  or  semi-finished 
material  directly  traceable  to  an  operation  or  part.  A general  expression  for  the 
direct  material  cost  can  be  formulated  as: 

cm  - Htti  + L2  + L3)Pm  - R 
where  » material  cost/unit 

W = weight  of  a unit  or  in  other  compatible  dimensions  relating  to  price 
L^  = scrap  loss 
L£  = waste  loss 
Lj  = shrinkage  loss 

Pm  = price  per  pound,  length,  volume,  or  area 
R = anticipated  salvage  value  of  material 

Here,  scrap  loss  refers  to  rejection  of  defects  consisting  of  unusable  raw  material 
including  improper  material  layout,  fracture  of  formed  parts,  etc.  Waste  loss  is 
illustrated  by  that  material  remaining  after  parts  have  been  blanked  out  in  stamping 
operations.  Shrinkage  losses  may  occur  in  certain  processes  or  materials  such  as 
wood  and  plastics. 

The  general  expression  for  direct  material  cost  can  be  modified  for  specific 
processes.  For  example,  in  a sheet  metal  stamping  operation  (assuming  no  salvage): 

Cm  * WLtPm(l+e) 

where  W * stock  width 
L ■ blank  length 
t * stock  thickness 
e - expected  losses 


TABLE  1 - TOTAL  COST  COMPONENTS 


Major  Cost 
Component 

Prime  Cost 


Factory  Overhead 


Commercial  Expenses 


Sub-Components 


Direct  Materials 
Direct  Labor 


Indirect  Materials 
Indirect  Labor 
Fixed  & Miscellaneous 


f Distribution 
7 Administrative 


Typical  Elements 

Sheet  Metal,  Forging 
Setup,  Processing 

Supplies,  Lubricants 
Supervision,  Clerks 
Rent,  Utilities 

Advertising,  Samples 
Legal,  Auditing 


Selling  Cost 


Salaries,  Commisions 


The  stock  width  is  determined  by  the  width  of  the  part,  W , plus  the  allowable  margins 
on  each  side  of  the  blank.  It  is  possible  (for  a given  material)  to  express  the 
allowable  margin  as  a function  of  stock  thickness,  say  At.  Then: 

W =■  Wp  + 2At 

The  length  required  is  the  part  length,  Lp,  plus  the  required  distance  between  parts, 
again  expressed  as  a function  of  thickness,  Bt.  Thus: 

L = Lp  + Bt 

The  direct  material  cost  then  becomes: 

Cm  - (Wp  + 2At) (Lp  + Bt)tPm(l+e) 

Indirect  materials  are  those  that  are  necessary  for  operation  in  some  manner 
but  are  not  traceable  to  a specific  operation  or  component.  Cutting  fluids,  lubri- 
cating oils,  clerical  supplies,  etc.  are  examples  of  such  materials.  Because  the 
cost  of  indirect  materials  is  difficult  to  assess  on  the  process  or  product  level, 
the  cost  is  usually  charged  to  a given  operation  by  means  of  overhead  distribution. 
Indirect  labor  costs  are  handled  in  generally  the  same  manner  as  indirect  materials 
cost. 


Direct  labor  may  comprise  one  of  the  most  important  items  of  the  manufacturing 
cost  and  is  defined  as  the  cost  of  actually  producing  goods,  or  the  labor  cost 
directly  traceable  to. a given  part  or  operation.  The  direct  labor  cost  directly 
influences  the  setup  cost,  processing  cost  and  handling  cost  of  an  operation.  One 
means  of  expressing  these  costs  is  as  follows: 

Cpi  ” RpiTpi 
• Csi  * RsiTsi 

Q 

^hi  “ RhiThi 
H 

where  Cpi  = processing  cost  for  operation  i 
Cgi  = setup  cost  for  operation  i 
■ handling  cost  between  stages 
Rpi  * labor  rate  for  operation  i 
Rsi  - setup  rate  for  operation  i 
R^i  * labor  rate  for  handling 
Tpi  * processing  or  run  time  for  operation  i 
Tgi  *>  Setup  time  for  operation  i 
Thi  * handling  time  between  stages 
H = number  of  parts  per  unit  handling  load 
Q - quantity  or  lot  size 

For  convenience,  handling  time  at  a processing  stage  or  machine  is  included  in 
the  run  time,  T .,  and  the  setup  time,  Tg^,  includes  setup  for  a given  operation  as  well 
as  tear  down  to  clear  the  machine  for  the  next  operation. 
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The  manufacturing  overhead  cost  (similar  terms  include:  overhead  expense, 
burden,  indirect  expense,  factory  expense,  etc.)  can  be  defined  as  that  portion 
of  the  cost  which  is  not  clearly  associated  with  particular  operations  and  must 
be  prorated  among  all  the  cost  units  on  some  arbitrary  basis.  Essentially,  manu- 
facturing overhead  includes  all  production  costs  except  direct  materials  and  labor. 
Numerous  methods  exist  to  distribute  manufacturing  overhead  to  jobs.  These  costs 
may  be  applied  on  the  basis  of:  (1)  direct  labor  hours  or  cost,  (2)  direct  material 
cost,  (3)  direct  labor  cost  plus  direct  material  cost,  (4)  unit  of  product, 

(5)  machine  hours,  and  others.  Each  method  has  certain  advantages  and  disadvantages 
and  the  method  utilized  varies  from  company  to  company  and,  in  some  cases,  within 
the  various  sections  in  a company. 


The  determination  of  the  tooling  costs  for  a given  operation  creates  some 
special  problems  depending  on  the  type  of  tooling  required  and  the  magnitude  of  the 
costs  involved.  Considering  its  importance  to  production,  determination  of  tooling 
cos"  calls  for  special  attention  and  is  an  exception  to  the  practice  of  ignoring 
indirect  costs.  One  method  for  expressing  tooling  is  to  consider  it  to  be  composed 
of  two  parts:  A usage  or  depreciation  cost  and  a reconditioning  or  rework  cost. 


The  usage  cost,  C , may  be  expressed  as: 


Ju 


i + Nin2 


where  C, 


Ni 


initial  cost  of  tool 

number  of  parts  made  before  the  tool  must  be  reworked 
number  of  times  the  tool  can  be  reworked  before  discarding 


The  reconditioning  cost,  Cr,  may  be  written  as: 
C_  - Vr 


Ni 


where  Rr  ■ labor  and  overhead  for  tool  rework 
Tf  - time  to  rework  a tool 


The  tooling  cost  for  a given  operation  is  the  sum  of  these  two  components: 


Mr 


1 + NjN2 


Note  that  the  same  type  of  formulation  can  be  used  for  inspection  and  gaging  operations. 


Having  described  the  basic  costs  Involved  on  the  micro  level,  it  is  now  possible 
to  write  a general  expression  for  the  manufacturing  cost  for  a given  part.  The  sit- 
uation is  schematically  shown  in  Figure  3 for  a material  undergoing  n processing 
steps  to  produce  a given  part,  and  the  total  cost  equation  (without  overhead)  is: 


TC  - 

Cm 

♦JjW.l  + Cpl  ' 

TC  - 

Cm 

+ l si^si  . „ 

in 

l-A  Q +R, 

'ti; 


i-0 


'hi 


'oi 


R .T 
ri 


iTrl\ . “ RhiT 
Nii  ) tio  H, 


Pl  Pl  1 + *11*21 
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The  manufacturing  overhead  cost  can  be  added  to  this  expression  depending  on 
the  method  that  is  used.  For  example,  if  overhead  is  based  on  total  direct  labor 
hours,  then  the  overhead  cost  allocated  to  unit  cost  is: 

C0H  * R0H  i»i(Tsi  + Tpi  + Thi  + Tri) 
where  R_„  * overhead  rate  as  a decimal. 


DETERMINING  MODEL  PARAMETERS 


In  order  to  utilize  the  proposed  economic  model  for  evaluating  alternative  part 
designs  or  processing  methods,  relationships  between  part  characteristics  and  cost 
or  time  values  must  be  established.  The  economic  model  parameters  which  must  be 
estimated  include: 

1)  setup  time 

2)  run  or  processing  time 

3)  handling  time 

4)  initial  tooling  cost 

5)  tool  life 

6)  tool  reconditioning  time 

7)  material  requirements  including  scrap  loss 

These  parameters  must  be  related  to  certain  part  characteristics  which  might  include 
the  following: 

1)  material  type 

2)  length,  width,  thickness 

3)  tolerances 

4)  number  of  bends,  radii,  direction 

5)  part  shape 

6)  holes 

7)  joggles,  beads 

8)  surface  condition  requirements 

9)  part  identification  requirements 


This  list  is  by  no  means  complete,  and  other  part  characteristics  which  influence 
manufacturing  cost  will  be  identified  upon  examination  of  information  gathered  from 
industry. 

It  is  anticipated  that  multiple  regression  analysis  will  be  used  to  develop  the 
relationships  between  part  characteristics  and  the  economic  parameters.  Equations 
which  will  predict  the  values  of  the  various  economic  parameters  as  a function  of 
part  characteristics  will  be  developed.  Application  of  such  a method  will  also  allow 
the  determination  of  confidence  intervals  and  ranges  on  manufacturing  time  and  cost 
for  various  alternatives.  In  addition  to  showing  the  main  effects  of  part  characteris- 
tics on  costs,  any  interacting  effects  among  the  variables  can  be  identified. 
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IV  PROCESS  PLANNING  SYSTEM 


As  mentioned  earlier,  utilization  of  an  economic  model  for  sheet  metal  operations 
requires  an  input  which  specifies  a process  plan,  i.e.,  the  operations  and  sequence 
necessary  to  fabricate  the  part.  Consequently,  it  is  logical  to  consider  developing 
a computer  aided  process  planning  system  in  conjunction  with  the  development  of  an 
economic  model.  The  advantages  of  automated  process  planning  systems  are  numerous 
and  the  feasibility  of  such  systems  has  been  well  established  in  the  area  of  machining. 

Two  basic  approaches  for  constructing  an  automated  process  planning  system  exist. 
One  method  is  to  establish  a file  of  coded  parts  and  their  associated  process  plans. 

A new  part  can  then  be  coded,  the  file  searched  for  an  identical  or  similar  part, 

[ ; the  process  plan  retrieved  and  modified  as  necessary.  The  second  method  involves 

generating  a process  plan  from  part  characteristics  in  an  analytical  manner.  This 
method  requires  determining  the  connection  between  part  features  and  processing  opera- 
tions. Actually,  a combination  of  both  methods  would  appear  to  be  the  most  desirable. 

The  overall  problem  may  be  viewed  as  a sequential  decision  process  where  the 
decision  made  at  one  stage  may  have  a significant  influence  on  subsequent  decisions. 

The  selection  of  a particular  metal  may  dictate  certain  processing  steps;  for 
example,  deburring  of  brake  form  blanks  prior  to  forming  may  be  necessary  in  order 
to  prevent  end  cracks  or  end  displacement  depending  on  the  metal  selected.  Part 
size,  shape, tolerances,  finish  requirements,  etc.,  will  influence  the  alternatives 
available  at  each  processing  step. 

Just  as  the  economic  model  will  require  relationships  between  part  characteristics 
and  various  cost  components,  the  process  planning  system  will  need  relationships 
between  part  features  and  processing  steps.  In  fact,  the  data  base  requirements  for 
the  economic  modeling  effort  and  the  process  planning  effort  are  highly  similar, 
which  enhances  the  idea  of  concurrent  development.  By  inputing  certain  information 
about  the  part,  alternative  methods  for  processing  at  each  stage  can  be  generated. 

These  process  plans  can  then  be  evaluated  by  using  the  economic  model  and  the  lowest 
cost  method  identified. 

One  approach  to  be  considered  for  determining  feasible  operations  at  each  stage 
of  processing  would  be  the  use  of  decision  tables.  These  tables  would  indicate  which 
processing  methods  are  potential  candidates  based  on  certain  relevant  part  characteris- 
tics. An  example  of  a decision  table  for  forming  method  versus  material  is  presented 
in  Table  2.  The  numbers  in  the  matrix  could  be  indicators  of  the  relative  technolo- 
gical desirability  of  the  forming  met hod -material  combination.  The  indicators  could 
reflect  such  factors  as  tolerance  control,  effect  on  material  properties,  process 
difficulties,  etc.  Cost  considerations  would  not  be  included  since  the  cost  of  the 
various  alternatives  will  be  determined  using  the  economic  model. 

After  examining  the  material  versus  forming  decision  table,  the  computer  would 
continue  to  the  next  part  parameter  (shape,  size,  etc)  versus  forming  method  and  so 
on  until  all  relevant  part  features  have  been  examined.  At  this  point,  a listing 
of  all  feasible  methods  for  the  processing  step  of  forming  along  with  a technological 
desirability  rating  of  the  methods  would  be  available.  This  procedure  would  continue 
for  each  processing  step  and  alternative  process  plans  would  be  generated.  These 
plans  would  then  act  as  input  to  the  economic  model  for  evaluation.  Thus,  both  a 
technological  desirability  and  cost  evaluation  would  be  conducted. 
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TABLE  2 

DECISION  TABLE:  MATERIAL-FORMING  METHOD 


Forming  Method 


*1 

F2 

F3 

F4 

• • 

Brake- 

form 

Hydro- 

form 

Die 

form 

Roll 

form 

Stretch 

form 

Mi 

Aluminum 

5 

3 

2 

2 

1 

m2 

Stainless 

Steel' 

5 

3 

2 

1 

1 

m3 

Carbon 

Steel 

5 

2 

1 

1 

1 

m4 

Titanium 

5 

2 

1 

0 

1 

M5 

Ni  Base 

5 

2 

1 

0 

1 

V PROGRAM  PLAN 


The  previous  sections  contained  discussions  indicating  the  need  and  potential 
usefulness  of  an  economic  model  and  computer  aided  process  planning  system  for  sheet 
metal  operations.  Certain  conceptual  approaches  to  the  problem  were  also  described. 

The  following  paragraphs  contain  an  outline  of  the  various  tasks  which  will  be  neces- 
sary to  develop  an  automated  process  planning  system  and  an  economic  model. 

1.  Review  of  previous  research.  A continuation  of  the  literature  review 
should  be  undertaken  in  three  major  areas: 

(a)  process  economic  modeling  - general  principles  and  methods  and  cases 
dealing  specifically  with  sheet  metal  operations 

(b)  computer  aided  process  planning  systems 

(c)  technical  information  pertaining  to  alternative  processing  methods  for 
sheet  metal  parts. 

2.  Establish  information  sources.  In  order  to  establish  a data  base  for  develop- 
ing the  economic  model  and  planning  system,  cooperation  with  industry  will  be  necessary. 
Potential  contacts  should  include: 

(a)  aircraft  industry 

(b)  other  industries  which  might  have  useful  information 

(c)  trade  associations  and  professional  societies 

(d)  equipment  and  tool  manufacturers 

• 

3.  Establish  data  base.  The  information  required  to  develop  the  proposed  model  and 
planning  system  should  include: 

(a)  process  plans  for  typical  sheet  metal  parts  showing  part  configuration 
and  processing  steps  from  raw  material  stores  to  finished  part. 

(b)  manufacturing  time  and  cost  data  for  setup  time,  processing  time,  tooling 
requirements,  and  handling  time. 

(c)  raw  material  requirements  and  cost 

(d)  tool  life  and  tool  reconditioning  costs 

4.  Analysis  of  process  plans.  The  processing  plans  for  various  parts  will  be 
analyzed  to: 

(a)  identify  processing  sequences  for  different  materials  and  configurations 

(b)  identify  alternative  processing  methods  at  each  production  stage 

(c)  identify  equipment  types  and  uses. 

5.  Part  characteristic  analysis.  This  task  will  involve  determining  the  part  features 
which  influence: 

(a)  processing  sequence 

(b)  setup  time  and  cost 

(c)  processing  time  and  cost 

(d)  handling  time  and  cost 

(e)  tooling  requirements  and  costs 

6.  Develop  predictive  equations  for  economic  model.  Information  derived  from  tasks 
(4)  and  (5)  will  be  used  to  generate  predictive  equations  for: 
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(a)  time  components 

(b)  cost  components 

7.  Construct  computer  aided  economic  model.  The  predictive  equations  developed  in 
task  (6)  will  form  the  basis  for  the  computerized  economic  model.  Construction  of 
the  computer  model  will  include: 

(a)  overall  program  structure  development 

(b)  specification  of  necessary  inputs 

(c)  development  of  cost  files 

(d)  specification  of  output  formats 

8.  Structure  process  planning  system.  This  task  is  concerned  with  structuring  a 
computer  aided  processing  planning  system  based  on  the  information  gained  in  tasl^s 
(4)  and  (5)  and  will  include: 

(a)  development  of  overall  program  structure 

(b)  identification  of  factors  for  process  decision  tables 

(c)  determination  of  technological  desirability  factors  and  ratings  of 
processing  alternatives. 

(d)  specification  of  input  requirements  in  the  form  of  part  features 

(e)  structuring  of  output  requirements  and  formats 

9.  Integration  with  existing  and  proposed  ICAM  systems.  Throughout  the  the  proposed 
program,  close  coordination  with  the  Air  Force  ICAM  effort  will  be  necessary.  Speci- 
fically, the  economic  model  and  process  planning  system  will  be  integrated  with: 

(a)  process  and  material  models 

(b)  group  technology  systems 

(c)  other  relevant  ICAM  programs 

10.  Testing  and  verification.  This  task  will  consist  of  testing  and  verification  of 
the  economic  model  and  process  planning  system.  Existing  parts  will  be  selected  and 
the  results  of  the  model  and  planning  system  will  be  compared  to  industry  results 
which  have  been  generated  by  other  methods. 

11.  Develop  utilization  plan.  This  effort  will  be  aimed  at  devising  methods  for 
implementing  the  system  in  industry  through  short  courses,  seminars,  published 
reports  and  papers,  etc. 

12.  Program  Assessment.  Feedback  from  system  users  will  be  evaluated  and  any  recom- 
mendations or  necessary  modifications  will  be  incorporated. 
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ABSTRACT 

FEASIBILITY  STUDY  ON  THE  USE  OF  THE  EPR 
TECHNIQUE  ON  DETECTOR  GRADE  SILICON 


The  feasibility  of  employing  the  electron 
paramagnetic  resonance  (EPR)  technique  in 
the  study  of  detector  grade  silicon  has 
been  investigated.  It  was  found  that  data 
can  be  collected  using  an  x-band  spectro- 
meter in  the  absorption  mode  at  room 
temperature  and  at  liquid  nitrogen  temp- 
erature. Also  additional  experimental 
directions  have  been  identified  involving 
K-band  EPR  and  the  dispersion  mode.  Other 
opportunities  exist  in  the  use  of  electron 
nuclear  double  resonance  (ENDOR) . Several 
dopants  and  impurities  were  examined,  and 
future  investigations  were  planned.  Inter- 
action between  the  Participant  and  the 
Research  Colleague  will  continue. 


This  is  a report  of  an  effort  in  the  1978  USAF-ASEE  Summer  Faculty 


Program  at  Wright-Patterson  Air  Force  Base.  The  Research  Colleague  and 
the  Laboratory  to  which  the  Participant  was  assigned  have  as  an  objective 
the  development  of  silicon  based  infrared  detector  materials.  This  work 
requires  knowledge  and  control  of  various  dopants  and  impurities.  The 
Participant  is  experienced  in  electron  paramagnetic  resonance  (EPR) , which  is  a 
tool  with  potential  for  monitoring  such  dopants  and  impurities,  but  one  which 
was  not  used  previously  at  AFML.  The  ten-week  activity  described  in  this  report 
was  directed  at  evaluating  the  feasibility  of  using  this  technique  to  work  on 
this  problem. 

Silicon  is  the  focus  of  the  detector  materials  program.  The  anticipated 

requirement  for  large  detector  arrays  makes  it  desirable  to  apply  the  extensive 

* 

technology  of  semiconductor  devices  to  the  infrared  detector  effort.  The 
general  objective  for  this  work  is  to  develop  detectors  for  the  1 to  25  micrometer 
raaae.  Pure  silicon  is  limited  in  its  response  to  wavelengths  of  1.1  micrometers 
r lees.  In  order  to  extend  this  range,  impurity  atoms  must  be  incorporated 

till  con.  Current  dopants  of  interest  include  indium  doped  silicon  for 
*«r-  «•*  - tie  range  3-5  micrometers  and  gallium  doped  silicon  for  the 

• m * -meter  range.  These  extrinsic  detectors  may  be  used  in  a monolithic 
ircult  technology.^  The  requirement  that  the  detector  array  have 
* enitorm  ability  to  detect  and  respond  from  element  to  element  dictates  impurity 
•ifonuty,  and  the  precise  control  of  defects.  Dopant  concentrations  range 
■ row  10^  to  10^  per  cubic  centimeter,  and  impurity  concentrations  are  as  low 
a«  10*“  per  cubic  centimeter.  This  requires  careful  materials  characterization. 


i 


. 


The  doping  of  silicon  Is  achieved  by  introduction  during  growth,  by 
thermally-controlled  diffusion,  or  by  ion  implantation.  The  electrical  proper- 
ties, the  time  response  of  the  device,  its  operating  temperature,  and  its  optical 
sensitivity  are  affected  by  the  presence  of  impurities.  To  optimize  these 
characteristics  of  the  detector,  careful  measurements  of  optical  and  electrical 
properties  must  be  made  as  a function  of  temperature.  Optical  properties  of 
Interest  include  spectral  absorption,  photoluminescence,  and  photoconductivity. 

The  electrical  transport  properties,  primarily  resistivity  and  Hall  effect,  are 

2 

the  responsibility  of  the  Research  Colleague.  With  these  measurements  various 

detector  materials  are  characterized,  including  the  influence  of  processing 

methods  such  as  thermal  treatment,  ion  implantation,  or  transmutation  doping. 

The  performance  of  silicon  infrared  detectors  is  dependent  on  starting  ’ 

material  quality.  The  degree  of  purity  of  the  crystal  depends  on  the  method 

of  growth.  This  present  work  included  samples  grown  by  the  Czochralski  method 

3 

in  quartz  crucibles  and  by  vacuum  multiple  float  zoning.  Some  examination  of 
the  growing  processesses  has  occurred.  The  float  zoning  method  in  general  is 
more  attractive  because  of  lower  impurity  concentrations. 

Undesired  and  often  unknown  defects  have  a negative  effect  on  the  proper- 
ties of  detectors.  These  often  result  from  crystal  growth  or  detector  fabrication. 
Heat  treatment  during  processing  of  the  detector,  or  annealing  of  ion-implanted 
or  neutron-transmuted  material  may  cause  a redistribution  of  necessary  dopants, 
or  may  Introduce  unknown  and  undesired  centers.  The  EPR  technique  has  been 
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employed  in  the  identification  of  a surface  defect  induced  in  silicon  during 


the  quench  after  thermal  annealing. 


EXPERIMENTAL 


The  laboratory  Involved  does  not  maintain  a magnetic  resonance 
facility.  In  order  to  evaluate  the  EPR  technique  for  use  on  silicon 
detector  materials,  it  was  necessary  for  the  Participant  to  make  use  of  the 
Magnetic  Resonance  Laboratory  in  the  Physics  Department  of  the  University 
of  Dayton,  his  home  institution.  That  facility  now  includes  two  complete 
electron  paramagnetic  resonance  spectrometers,  a nuclear  magnetic  resonance 
spectrometer  and  equipment  for  ENDOR  (electron-nuclear  double  resonance) . 

At  the  beginning  of  the  ten  week  summer  period,  the  second  EPR 
spectrometer  had  just  been  delivered  and  was  not  yet  operational.  Since  it 
has  greater  sensitivity  than  the  original  spectrometer,  it  was  decided  that 
the  more  recently  acquired  device  should  be  used  for  the  silicon  work  since 
weak  signal  levels  were  anticipated.  Initial  activities  included  making 
the  spectrometer  operational  and  performing  some  calibrations. 

The  device  used  in  this  study  is  a modified  form  of  the  Varian  Associ- 
ates model  4500  series  EPR  spectrometer.  EPR  spectroscopy  is  based  on  the 
electron's  intrinsic  angular  momentum  and  its  magnetic  moment.  The  ratio 
of  the  electron's  magnetic  moment  to  its  spin  value,  known  as  the  magnetogyrlc 
ratio,  is  distinct  and  constant.  However,  the  effects  of  the  local  magnetic 
environment  arising  from  the  fields  of  adjacent  nuclei,  and  from  coupling  to 
other  angular  momenta  make  the  effective  magnetogyrlc  ratio  unique  for  each 
physical  environment.  When  the  electron  is  unpaired,  it  is  possible  to  de- 
tect this  ratio  by  inducing  transitions  between  the  electron  Zeeman  levels, 
detecting  these  transition^  and  visually  displaying  them.  In  this  spectrometer 
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the  transitions  are  induced  by  electromagnetic  radiation  in  the  microwave  range. 

i' 

This  X-band  spectrometer  operates  at  about  9.5  Gigahertz,  or  about  9.2  Gigahertz 
with  a quartz  dewar  inserted.  The  external  megnetic  field  for  the  Zeeman 
splitting  is  provided  by  a twelve-inch  regulated  and  water-cooled  magnet 
with  field  scanning  provisions.  The  microwaves  are  produced  by  a klystron, 
are  distributed  by  a microwave  bridge  and  are  detected  by  a crystal  detector. 

The  original  hybrid  tee  bridge  has  been  replaced  by  a more  efficient  three 
port  circulator  arrangement.  The  klystron  is  stabilized  with  automatic 
frequency  control.  The  spectrometer  operates  in  the  absorption  mode. 

In  EPR  spectrometers  the  detector  noise  varies  inversely  with  fre- 
quency, and  therefore  the  higher  the  field  modulation  frequency,  the  better 
the  signal-to-noise  ratio.  Considering  such  factors  as  field  modulation, 
penetration  of  the  cavity  walls,  sample  relaxation  times  and  crystal  noise, 
however,  100  Kilohertz  has  resulted  as  the  optimum  choice  for  most  appli- 
cations. This  spectrometer  has  a 100  Kilohertz  crystal  controlled  oscillator 
which  generates  the  field  modulation  frequency,  and  a high  gain  amplifier 
and  phase  detector  for  detection  of  the  EPR  signal. 

The  manufacturer  states  that  the  sensitivity  of  the  system  is 
”2  x lO^AH  electron  spins  with  a one  second  integration  time,  assuming 
that  the  sample  has  negligible  dielectric  loss."  AH  is  the  linewidth  in 

gauss.  The  sensitivity  was  checked  with  a standard  supplied  by  the  manu- 
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facturer.  The  sample  is  3.3  x 10  % pitch  in  KC£.  It  contains  10  - 25Z 

spins.  Its  linewidth  is  1.7  gauss.  The  observed  signal-to-noise  ratio 

for  this  sample  was  13/1.2,  or  10.8  ratio.  This  suggests  the  possible 

detection  of  a minimum  of  9.3  x 10^  - 25%  spins.  For  this  case,  a 1.7 

gauss  line,  the  specifications  give  3.4  x 10^  spins. 


The  linearity  of  the  magnetic  field  scan,  the  Hall  field  sensing 
and  the  X-Y  recorder  were  checked  using  a known  field  marker.  The  six 
hyperf ine  lines  of  manganese  in  forsterite  has  been  investigated  by  the 
Participant^  using  nuclear  magnetic  resonance.  By  examining  the  sweep  rates 
between  the  five  neighboring  pairs  of  lines,  the  scan  rates  were  measured 
to  be  the  same  within  0.7Z  with  the  variation  random  in  nature.  From  time 
to  time  this  same  marker  has  been  used  as  a field  marker  and  intensity  com- 
parison reference  for  silicon  in  a manner  similar  to  that  employed  earlier 
by  the  Participant  in  work**  on  the  rare  earths  in  the  flourites.  For  the 
new  spectrometer  a series  of  four  of  these  markers  with  varying  intensities 
was  prepared.  The  design  is  such  that  one  marker  can  be  inserted  uniquely 

in  the  back  of  the  cavity  and  remain  unmoved  during  the  running  of  a series 
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of  silicon  samples.  Several  good  experimental  references  * * ’ were 
consulted  in  connection  with  these  experimental  efforts. 

Silicon  samples  were  run  at  X-band  at  room  temperature  and  at  nitrogen 
temperature  (77  K)  in  an  inserted  quartz  dewar.  Helium  temperature  (4.2  K) 
dewars  are  available  as  is  a Heli-tran  liquid  transfer  refrigerator  for 
variable  temperature  control  from  helium  to  room  temperature.  This  system 
has  not  been  employed  at  this  point  in  time. 

During  the  ten  weeks  reported  here  the  system  has  been  put  into  oper- 
ation at  77  K and  300  K,  has  been  calibrated,  and  a number  of  silicon  samples 
have  been  run  as  described  in  the  following.  A variety  of  topics  were  in- 
vestigated with  an  eye  toward  application  of  EPR  technique.  Measurements 
were  attempted  in  connection  with  some  of  them.  The  topics  will  be  discussed 
in  the  following  sections. 
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DANGLING  BONDS 


A simple  model  of  a (111)  surface  of  a diamond-structure  covalent 
semiconductor  suggests  one  cut  bond  per  surface  atom.  This  leads  to  the 
expectation  that  each  such  electron  would  produce  a "dangling  bond". 

These  electrons,  if  they  remain  largely  unpaired,  should  be  detectable 
by  EPR  measurements.  EPR  signals  have  been  found  from  silicon  surface 
regions. ^ There  has  been  interest  in  determining  whether  this  model  is 
correct  or  whether  the  atomic  rearrangements  known  to  occur  on  most  semi- 
conductor surfaces  cause  major  modifications.  The  dangling  bond  concept 
has  been  the  simple  starting  point  for  many  theoretical  discussions  and 
could  not  be  seriously  tested  while  surface  structures  remained  uncertain. 

Recently,  theoretical  computations  of  surface  states  have  been  carried  out 
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by  Schluter  and  co-workers  for  the  surface  model  of  Haneman.  This  work 

concluded  that  paired  electrons  exist  on  alternate  sites,  which  would  yield 

14  15 

only  a very  weak  EPR  signal.  Recent  EPR  measurements  * have  shown  that 

there  is  indeed  a negligible  EPR  signal  from  well-cleaved  silicon  surfaces. 

In  a February  1978  article, ^it  was  reported  that  the  above-mentioned 

signal  is  observed  in  less  well-cleaved  surfaces  and  that  the  origin  is  now 

believed  to  be  due  to  localized  states  on  microcrack  surfaces. 

Much  data  has  shown  an  EPR  signal  to  be  present  in  the  surface  region 

of  silicon.  The  results  to  1974  have  been  reviewed. ^ In  summary,  an  EPR 

signal  at  g*2.0055  of  width  6 to  7 gauss,  has  been  found  at  room  temperature 

from  silicon  surfaces  that  have  been  abraded,  produced  by  crushing^,  by 
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cleavage,  irradiated  with  neutrons,  heated  in  vacuum  and  quenched  , and 

19  20 

from  amorphous  films.  ’ 


K, 


Recent  work  showed  that  a well-cleaved  surface  has  less  than  one 


spin  per  one  hundred  atoms.  The  fact  that  the  spin  density  from  high 


quality  cleaved  surfaces  can  be  very  much  lower  than  for  poorer  surfaces 


points  to  an  origin  associated  with  cleavage  rather  then  with  the  flat 


surface.  The  evidence  has  led  to  a conclusion  that,  since  the  GPR 


signal  that  appears  when  silicon  is  crushed,  cleaved,  or  abraded  is  pro- 


portional to  the  areas  of  microcracks  induced,  the  origin  of  the  signal 


may  be  localized  states  on  the  surfaces  of  the  microcracks. 


In  the  present  work  a number  of  attempts  have  been  made  to  see 


so-called  dangling  bonds.  Several  likely  samples  were  examined  at  77  K 


and  300  K.  The  usual  surface  bonds  are  tied  up  with  silicon  dioxide. 


Etching  treatments  have  been  used  in  an  attempt  to  free  surface  bonds. 


The  procedure  used  included  thirty  seconds  in  HF  (49 %)  followed  by  thirty 


seconds  in  HNO^  (65%)  : HF  (49%)  : CH^COOH  “ 3:2:2  and  rinsing  with  deionized 


water.  The  prescribed  treatment  called  for  15  minutes  in  the  combined 


solution  but  an  experience  with  a 28  milligram  sample  which  vanished  in  four 


minutes,  reduced  the  time  to  thirty  seconds  in  this  version. 


No  detectable  resonance  was  observed  in  any  of  these  attempts.  In 


one  attempt  the  sample  was  immersed  in  liquid  nitrogen  immediately  after 


etching  with  the  same  result. 


RADIATION  DAMAGE 


EPR  has  been  a useful  experimental  technique  for  the  study  of  the 


structure  and  kinetics  of  defects  in  solids.  Although  restricted  to  centers 


that  are  paramagnetic,  under  certain  conditions  it  is  possible  to  convert 
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a defect  from  the  diamagnetic  to  the  paramagnetic  state.  Semiconductors 

are  especially  versatile  in  this  respect  since  the  initial  position  of  the 

Fermi  level  is  dependent  upon  the  concentration  of  n-  or  p-  type  dopants. 

In  silicon, irradiation  produces  deep,  localized  levels  which  compensate  the  shallow 

donor  and  acceptor  levels  and  consequently  pull  the  Fermi  level  towards 

the  middle  of  the  bandgap.  As  this  occurs  many  of  the  defects  alternate 

between  diamagnetic  and  paramagnetic  states. 

EPR  studies  in  silicon  damage  have  been  particularly  successful,  as 

22  23  24  25  26 

demonstrated  by  the  early  series  of  papers  » » ♦ » by  Watkins  and 
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co-workers.  Various  reviews  ’ * of  the  sub j ect  have  appeared  over  the 

years.  An  example  of  a few  of  the  many  silicon  centers  observed  is  presented 
in  the  following  table: 


* 

Center 

Si-E 

Si-A(Si-Bl) 

Si-N 

Si-J(Si-G5) 

Si-C(Si-G7) 


Host 

Float  zone 
Pulled 

Pulled 

Pulled 


Doping  Particle 

n-type;  P electrons 

n-type;  P electrons 

fast  neutrons 

p-type;  B,Al,Ga,In  electrons 
n-type;  P electrons 


Energy 
1 . 5MeV 
1 . 5Me V 

Pile  spectrum 
1.5MeV 
1 . 5MeV 


Note:  At  least  two  sets  of  nomenclature  are  used  in  the  literature  of  irradia- 
tion induced  defect  centers. 

At  this  point  in  time,  no  irradiated  silicon  samples  have  been  investi- 
gated in  this  experimental  effort. 

It  is  useful  to  reflect  on  the  feasibility  of  attempting  to  examine 
radiation  damaged  silicon  with  the  present  spectrometer.  A number  of  authors 
have  reported*-  damage  on  x-band  (9.1  to  9.5  Gigahertz)  spectro- 
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meters  at  room  temperature  and  liquid  nitrogen  temperature.  However,  others 

have  found  it  advisable  to  use  other  experimental  conditions  for  studies  of 

31  32  33  34 

radiation  damaged  silicon.  .For  example,  a number  of  experimenters  ’ ’ ’ 

have  gone  to  K-band  (19  to  20  Gigahertz)  spectrometers.  Also  work  on  damaged 
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silicon  is  often  done  by  using  an  ENDOR  spectrometer.  ’ * 


ION  IMPLANTED  SILICON 


Ion  implantation  is  a useful  technique  for  introducing  dopants  into 
the  silicon  crystal.  In  the  process,  ions  of  a particular  impurity  element, 
such  as  boron  or  phosphorus,  are  shot  from  an  accelerator  into  a crystal 
such  as  silicon.  The  choice  of  the  injecting  energy  determines  the  depth 
to  which  the  silicon  is  doped.  However,  two  major  problems  limit  the  de- 
sirability of  ion  implantation.  The  bombardment  with  high-energy  ions  dis- 
rupts the  orderly  arrangement  of  the  crystal  lattice  giving  much  surface 
damage.  In  addition  the  ions  do  not  all  settle  in  substitutional  lattice 

sites  to  become  donors  or  acceptors.  This  damage  lends  itself  to  study  by 

38  + 

EPR.  The  first  such  report  concerned  0 ions  implanted  in  Si:A£  and  Si:B 
crystals.  In  this  initial  report  a layer  of  less  than  15,000  X was  reported 
due  to  300  KeV  0+-ions.  The  so-called  S1-P3  center  is  the  dominent  para- 
magnetic defect  produced  at  room  temperature  in  both  samples,  and  it  anneals 

below  200°C.  The  Si-Pi  center  dominates  above  200°C  to  near  350°C^  where  it 

39 

anneals.  Additional  EPR  investigations  led  to  depth  distributions  , other 

40  41  42  43 

implanted  ions,  * and  other  defect  centers.  ’ Another  worker  has 

44  + 

reported  that  1 MeV  0 -ions  implanted  in  silicon  produces  amorphous  layers 
which  recrystallize  after  1000°C  anneals. 


A 


45 

A more  recent  study  of  Interest  in  the  current  investigation  con- 
cerns an  EPR  investigation  into  phosphorus  implanted  silicon.  In  the  in- 
vestigation, an  x-band  spectrometer  was  used  at  77°K.  It  was  reported 

that  P-ion  implantation  reduced  the  g-factor  (magnetogyric  ratio) . An 

13  16  —2 

increase  in  ion  dose  from  6 x 10  to  2 x 10  cm  reduced  monotonically 
the  g-factor  from  1.9990  to  1.9980  for  samples  annealed  at  1150°C  and  from 
1.9990  to  1.9970  for  those  annealed  at  1070°C.  In  each  case  the  EPR  signal 
of  the  conduction  electrons  was  isotropic  and  the  profile  was  Lorentzian. 

The  line  widths  increased  from  2 gauss  to  12  gauss  (1150°C)  or  30  gauss 
(1070°C) . Successive  removal  of  thin  silicon  layers  resulted  in  reverse 
changes;  the  g-factor  gradually  returned  to  1.9990  and  the  width  to  2 gauss. 
These  results  were  fit  to  an  analytical  expression. 

This  brief  sketch  gives  an  indication  of  the  possibilities  that 
exist  with  the  EPR  technique.  To  this  point  in  time  the  Participant  has 
not  had  access  to  ion  implanted  samples  for  investigation  with  the  EPR 
technique. 


NEUTRON  TRANSMUTATION  DOPING 

12  -3 

Boron  impurities  of  the  order  of  10  cm  are  present  in  the  purest 
of  available  silicon.  This  shallow  acceptor  has  several  negative  effects 
on  the  operation  of  gallium  or  indium  doped  silicon.  The  boron  ionization 
energy  is  less  than  those  of  the  two  dopants  and  the  boron  levels  thermally 
ionize  at  lower  temperature.  This  requires  additional  detector  cooling 
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which  increases  weight  and  cost.  This  difficulty  can  be  eliminated  if  the  boron 
levels  can  be  populated  by  carriers  from  compensating  donor  levels.  Nearly 
exact  compensation  of  these  boron  acceptors  by  the  addition  of  shallow  donors 
can  eliminate  the  difficulty.  However,  greatly  overcompensating  the  boron 
is  undesirable  since  this  would  start  to  fill  the  gallium  or  indium  levels. 
Overcompensation  also  reduces  carrier  mobility  carrier  lifetime,  and  detector 
gain. 

A suitable  shallow  donor  is  phosphorus,  31P.  It  can  be  doped  into 
silicon  by  neutron  transmutation  doping.  The  process  is  based  on  converting 

3^S1  to  3^Si  by  an  (n,  y)  reaction  via  thermal  neutrons  from  a reactor.  The 

31  31 

Si  decays  by  negative  beta-decay  to  P,  the  impurity  sought.  The  donors  . 

produced  are  distributed  uniformly  throughout  the  lattice.  Also  the  rate 
of  production  of  dopant  can  be  carefully  controlled. 

These  two  major  advantages  of  neutron  transmutation  doping  are  de- 
flated somewhat  by  the  radiation  damage  effects  that  occur  during  the  radi- 
A6 

ation.  As  mentioned  in  earlier  sections,  the  EPR  technique  has  potential 

for  monitoring  the  damage  and  its  repair.  Several  EPR  studies  of  neutron- 

47  48  49 

irradiated  silicon  have  been  reported.  ’ ’ The  EPR  spectra  of  phosphorous 

dopants  can  be  compared  to  other  work3^’^  on  Si:P,  giving  opportunities  for 
Interpretation . 

Several  pieces  of  neutron  transmutation  doped  silicon  have  been  ex- 
amined in  this  effort.  At  300K  and  77K  EPR  spectra  were  observed.  An 
example  of  the  absorption  derivative  trace  is  displayed  in  Figure  1,  on 
the  next  page.  In  the  trace  one  can  see  a large  line  and  much  structure. 
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LASER  ANNEALING 


> 


The  damage  mentioned  earlier  under  ion  implantation  and  neutron 
transmutation  doping  is  often  repaired  by  an  annealing  step.  Conventional 
practice  involves  placing  the  sample  in  a high  temperature  furnace,  perhaps 
under  vacuum,  for  times  on  the  order  of  an  hour.  This  thermal  annealing 
repairs  much  lattice  damage  but  not  all.  In  addition  it  often  degrades 
electrical  properties. 

The  need  to  heat  only  the  damaged  region  of  the  sample  led  to  laser 
annealing.  A laser  provides  a strong,  highly  controllable  source  of  energy 

CO  CO  CL 

with  the  additional  advantage  of  being  monochromatic.  The  discovery  ’ * 

that  pulsed  laser  irradiation  of  damaged  crystals  is  effective  in  removing 
defects  stimulated  additional  experimentation^ It  has  been  demon- 
strated^ that  overlapping  laser  pulse  spots  can  be  used  to  produce 

single-crystalline  annealed  areas,  and  that  the  laser  power  may  be  varied 

59 

to  achieve  dopant  depth  profile  control.  It  has  also  been  reported  that 
periodic  property  variations  are  produced  in  the  annealed  material.  In 
addition  to  pulsed  laser  annealing,  continuous  wave  laser  annealing  has 
also  been  reported.  More  recent  reports  offer  other  advan- 

tages to  laser  annealing. 

During  the  ten  week  period  of  this  project  consideration  was  given 

I 1 

to  the  feasibility  of  attempting  an  EPR  monitor  of  the  laser  annealing 

I 

process.  While  the  approach  is  very  desirable,  it  was  decided  that  the 

short  time  available  made  such  an  attempt  by  the  Participant  unwise.  In  * 

* I ! 

addition  access  to  laser  annealed  samples  has  not  been  possible.  However 

1 I 

I j 

. -jH 
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laser  annealing  continues  to  appear  very  attractive  because  of  the  potential 
for  defect-free  regrowth  and  because  of  no  loss  of  dopant  during  the 
treatment . 

Si: In  X-LEVEL 

The  study  of  indium-doped  silicon  for  use  as  a detector  in  the 
3-5  micrometer  range  has  led  to  an  interesting  puzzle,  the  identity  of 
the  "X-level."  A new  acceptor  level  located  at  0.111  ± 0.002  eV  from 
the  valence  band  has  been  observed^  in  indium-doped  silicon.  It  is  be- 
lieved to  be  in  all  Si: In  samples  except  when  masked  by  overcompensation.  The 
existence  of  this  level  with  an  ionization  energy  significantly  less  than 
that  of  In,  0.156  eV,  reduces  the  maximum  temperature  for  background- 
limited  performance.  The  usual  method  of  eliminating  the  effect  on  low 
temperature  behavior  by  overcompensating  the  shallow  impurity  center  is 
undesirable  here  due  to  the  degradation  in  photoconductive  properties 

that  would  result.  Identification  of  and  control  of  this  defect  is 

67  68 

obviously  needed.  This  has  resulted  in  study  ’ of  the  center,  but 

thus  far  without  identification.  Comparisons  can  be  made  to  earlier  work^’^® 

71  72  73  74 

and  to  more  recent  studies  * ’ ’ on  the  system.  The  most  promising 

suggestions  for  explanations^’ ^ have  been  that  an  indium  complex  is 
responsible. 

During  the  ten-week  summer  effort  several  attempts  were  made  to 
observe  an  EPR  spectrum  in  indium-doped  silicon.  The  main  results  are 
summarized  in  the  table  on  the  following  page. 
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SUMMARY  OF  INDIUM-DOPED  SILICON  RESULTS 


Crystal  ID 

In  cone. 

X cone. 

Mass 

Trace 

Observation 

(cm-3) 

(cm-3) 

(mg) 

DC  002  #18 

2.72  X 1017 

3.32  X 1012 

28.56 

GKM 

7-25-78-3 

Weak  peak  near  g*2 

GZ-163-26 

2.04  X 1017 

O(undetected) 

16.90 

GKM 

7-13-78-2 

Weak  peak  near  g«*2 

GKM 

7-25-78-6 

AFML-001-005 

2.5  X 1016 

O(undetected) 

21.59 

GKM 

7-31-78-11 

No  spectra  77K  or  300K 

GKM 

7-31-78-15 

% 

AFML-001-011 

69.10 

GKM 

8-7-78-8 

No  spectra  77K  or  300K 

GKM 

8-7-78-9 

AFML-001-011 

15.27 

GKM 

8-21-78-1 

No  spectra  300K 

GZ-185  #8 

NONE  (Gallium  impurity) 

77.92 

GKM 

7-6-78-6 

Strong  peak  near  g=2 

GKM 

7-6-78-8 

at  77K  and  300K 

GKM 

7-24-78-6 

GKM 

7-25-78-4 

The  peak  near  g-2  is  observed  in  some  of  the  Si: In  samples 
but  not  in  others.  It  is  curiously  like  a stronger  peak 
observed  in  a Si:Ga  sample,  listed  last  in  the  table.  This 
spectrum  is  given  in  Figure  2 on  the  next  page.  The 
source  of  this  peak  has  not  been  identified. 


FUTURE  DIRECTIONS 


Since  the  literature  search  and  the  experimental  investigations  conducted 
during  the  summer  show  promise  for  the  use  of  the  EPR  tool  in  the  study  of  detector 
material  silicon  by  AFML,  it  has  been  established  that  the  l'articipant  and 
the  Research  Colleague  would  like  to  continue  interaction  during  the  coming 
year.  It  is  anticipated  that  the  Participant  will  expend  effort  on 
the  project  in  the  coming  months.  The  Research  Colleague  will  supply  appropriate 
samples  for  study  and  will  interact  with  the  Participant  on  any  results.  The 
facilities  of  the  Magnetic  Resonance  Laboratory  in  the  Physics  Department  at  the 
University  of  Dayton  will  be  used  in  the  further  investigations. 

In  the  continuation  of  the  investigation,  one  effort  will  be  with  dangling 
bonds.  Further  variations  in  the  wide  variety  of  sample  preparation  treatments  ' 
possible  will  be  attempted.  Additional  experimental  configurations  will  be  in- 
vestigated, as  discussed  later. 

Additional  study  of  the  neutron  transmutation  doped  samples  will  be  conducted. 
With  a greater  variety  of  samples,  it  should  be  possible  to  unravel  the  spectrum 
observed.  This  is  observable,  of  course,  with  the  present  experimental  configuration. 

Due  to  the  importance  of  understanding  the  "x-level"  in  indium-doped  silicon, 
continued  investigations  will  be  made  on  that  system.  A series  of  experiments  at 
AFML  conducted  by  Dr.  V.  Swaminathan  should  produce  a variety  of  Si:In  samples  in 
the  near  future.  These  samples  should  have  various  concentrations  of  indium  and 
"x-level".  It  is  anticipated  that  these  samples  will  be  investigated  after  a range 
of  heat  treatments. 

Any  other  appropriate  samples  that  become  available  will  be  examined.  For 
example,  ion  Implanted  samples  would  be  interesting.  In  addition,  laser  annealed 
samples  such  as  those  under  study  at  AFIT  would  provide  a useful  investigation. 


Electron  Spin  Resonance  Spectra  of  Donors  in  Silicon 
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In  future  investigations,  the  experimental  configuration  will  be 


altered.  The  need  is  demonstrated  in  the  table  taken  from  Reference  8 and 

] 

presented  on  the  next  page.  It  is  a typical  summary  of  EPR  centers  seen 
in  semiconductors,  mostly  silicon.  Examination  of  this  table  immediately 
leads  to  the  observation  that  most  of  the  work  is  done  in  the  dispersion 
mode  or  using  the  ENDOR  technique.  The  work  of  this  summer  was  done  only 
in  the  absorption  mode. 

Long  spin-lattice  relaxation  times  and  low  defect  concentrations  place 

* 

stringent  conditions  on  the  experimental  techniques  required  to  observe 

79 

defects  in  silicon  with  EPR.  As  a result  operating  conditions  and  sensitivity 
are  crucial  factors  in  the  success  of  EPR  measurements.  Because  the  spin-lattice 

! 

relaxation  times  of  paramagnetic  centers  in  silicon  are  very  long  (as  least 

one  second)  at  low  temperatures,  it  is  usually  not  possible  to  observe  any  signal 

below  ~30K  in  the  absorption  mode  because  the  signal  is  completely  saturated 

even  at  the  lowest  microwave  power  levels.  Many  of  the  defects  in  silicon 
22—26 

are  observed  with  EPR  under  the  following  conditions: 

1)  Temperature  near  20K 

2)  Dispersion  mode 

3)  Power  of  the  order  of  a microwatt 

4)  Modulation  frequency  less  than  one  kilohertz 

5)  K-band  microwaves,  and 

6)  A cylindrical  cavity. 


— ’ «* 


Table  7.2-2 

Examples  or  Spin  Centers  and  EPR  Parameters  Studied  in  Semiconductors 
Donor  or  acceptor  Spin  center  or  impurity 


Host 

Concentra- 

Concentration 

Observed 

crystal 

Element 

tion 

Treatment 

Studied  per  cm* 

Temp.  *K 

parameters 

Mode 

Ref. 

Silken 

P 

IO“-IO“ 

Strained. 

• 

Si-4  center  — 10“ 

20  and  77 

t.  hfs,  r 

Dispersion 

W-7 

irradiated  O) 

Silkon 

Sb.  P.  As 

10“ 

Strained 

Donors 

1.23 

Ay,  7a 

Dispersion 

W-22 

Silicon 

P 

10“-I0“ 

Strained, 

Si-£  center 

4.2-133 

hfs,  |.  r 

Dispersion 

W-5 

irradiated  (*) 

and  ENDOR 

Silkon 

B.  Al.  Ga,  I 

Strained, 

Si-C6 

40-110 

hfs,  g,  r 

Dispersion 

W-4 

irradiated  (t) 

Si-G7 

Silkon 

B.P 

-2  x 10“ 

Irradiated  (e) 

Si-G6  formerly  — 10“ 

20.4 

hfs,  g 

Dispersion 

C-22 

(B) 

~2  x 10“ 

Si-4 

to  J x 10“ 

<P> 

Silkon 

B.P 

~I0“ 

Transition 

V»*Cr*Mn*  I0“-I0“ 

1.3  and  20.4 

hfs,  g 

ENDOR 

W-21 

metals  diffused 
in  at  I2X)°C 

Mna*  Fe* 

% 

Silicon 

Metals  diffused 

Pd  and  Pt  1-4  x 10“ 

Up  to  20 

hfs,  t 

ENDOR 

W-26 

in  at  I300*C 

Silkon 

P.  As,  Sb 

5 x 10«- 

Strained 

4.2 

hfs,/ 

Dispersion 

J-4  F-12, 

6 x 10“ 

F-13 

Silkon 

P 

3 x 10*- 

12  to  4 

hfs,  r 

F-2.  F-4, 

10“ 

F-J 

Silkon 

“•Sb,  ‘"Sb 

5 x 10* 

1.2 

hfs,f 

ENDOR 

E-6 

Silkon 

B.  As 

10“  As. 

Neutron 

N(ii,  iii)  ix.  a.  n 

300,  77.  4.2 

hfs.  g 

Absorption 

J-9 

residual  B 

irradiated. 

(V.  VI).  (VII. 

I0“-I0« 

NVT* 

VIII) 

• 

Silkon 

P 

« x 10“  and 

1.3 

hfs 

ENDOR 

J-4 

• x 10“ 

Gcr- 

P.  As 

I x I0“- 

1.3 

hfs,  A/f.  t 

F-« 

minium 

J x 10“ 

SiC 

S.  N,  B.  Ni  5 x I0“-I0“ 

7S 

t.  AN.  hfs 

W-19 

SiC 

B 3 x io*n 

N 3 x 10“/ 

14  and  20.4 

hfs.  t 

ENDOR 

L-15 

W-27 

*NVT  ■ neutrons  per  cm*. 


Table  taken  from  Reference  8.  The  references  indicated  in  the  table 
are  specified  in  that  text  and  are  not  reproduced  here. 
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Under  these  conditions  the  spectra  are  usually  observed  in  fast 
passage.  Usually  low  temperatures  are  required  because  of  the  need  for 
sensitivity.  Unless  the  sample  is  intrinsic,  the  lower  temperatures  are 
also  required  in  order  to  freeze  out  the  carriers  so  that  loading  of  the 
cavity  is  minimized.  At  these  low  temperatures  even  the  dispersion  signal 
begins  to  saturate  for  higher  power  levels.  For  magnetic  field  modulation 
frequencies  larger  than  one  kilohertz,  the  lines  observed  in  fast  passage 
behave  as  though  they  are  partially  saturated.  K-band  frequencies  are 
advantageous  over  X-band  because  of  the  added  resolution  one  gets  in  the 
Zeeman  spectrum.  The  cylindrical  cavity  has  a higher  Q than  the  regular  » 
rectangular  cavity. 

In  the  coming  months  attempts  will  be  made  to  alter  the  spectrometer 
in  this  fashion  for  use  in  the  detection  of  other  paramagnetic  centers  in  silicon. 
This  should  make  possible  new  observations  in  the  samples  described  above. 

One  of  the  early  reports  of  use  of  the  ENDOR  technique  was  made  by 
80 

Feher.  He  used  the  approach  to  investigate  shallow  donors  Sb,  P,  and  As. 

In  addition  he  studied  chemical  impurities  Bi,  Li,  and  Fe,  as  well  as 
centers  associated  with  the  surface  of  the  sample  and  with  the  heat  treat- 
ment of  silicon.  Finally  he  also  examined  the  influence  of  substitutional 
germanium  atoms  on  the  resonance  line  in  Si:P  samples.  As  time  permits  this 
ENDOR  technique  will  be  explored. 
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The  work  has  established  a metamodel  of  the  simulation  modeling 
process.  The  approach  utilizes  structured  analysis  to  form  a 
hierarchical  cell  model  which  describes  the  functional  decompo- 
sition of  the  modeling  process  and  data  flows. 
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INTRODUCTION 


This  report  contains  a metamodel  of  the  simulation  modeling  process. 

The  metamodel  is  described  from  the  viewpoint  of  a simulation  modeler  and 
as  such  reflects  the  model  design  development  and  implementation  process. 

A structured  analysis  approach  (See  reference  (R0S77)  is  used  to  represent  a 
hierarchical  model  of  the  process  of  developing  a model  design,  implementing 
the  model  design  as  a simulation  program,  and  performing  subsequent  analysis. 
The  metamodel  is  taken  to  two  levels  completely  with  several  blocks  out  into 
lower  levels. 


A commentary  is  presented  discussing  various  aspects  of  the  model  and  a 
tree  structure  discussed  to  help  understand  how  the  metamodel 's  hierarchy  is 
broken  out  into  its  components.  The  metamodel  is  then  presented. 

MODEL  COMMENTARY 

A hierarchy  implies  levels  with  priorities  associated  with  each  level. 

Thus,  those  activities  specified  at  high  levels  should  be  considered  to 
have  a higher  priority  or  level  of  importance  than  those  activities  at  a 
lower  level.  This  assumption  is  used  to  specify  activity  blocks  within 
a given  level  and  is  based  upon  prevalent  attitudes  from  the  literature 
(see  bibliography)  and  the  author's  own  experience. 

The  highest  level  is  the  A-0  diagram  which  identifies  the  model, 
principle  inputs,  controls,  mechanisms  and  output.  Thus,  we  have  specified 
the  simulation  modeling  process  as  separate  from  its  environment  except  for 
the  inputs,  controls,  mechanisms,  and  outputs  identified.  This  assumption  is 
not  completely  accurate  but  it  is  necessary  in  order  to  allow  the  creation  of 
a managable  model. 

The  entities  in  A-0  are  interfaces  between  the  system  (i.e.  the  simulation 
modeling  process)  and  its  environment.  These  then  are  primary  external  entities 
of  the  system  and  act  as  primary  drivers  of  the  system.  These  entities  are 
selected  as  the  primary  entities  in  which  the  inputs  and  outputs  are  casually 
related  through  a mechanism  constrained  by  controls. 

The  first  level  is  the  AO  diagram  which  identifies  four  principal 
activites,  problem  formulation,  preliminary  model  formulating,  analysis 
tool  development  and  the  actual  analysis.  The  second  level  is  a breakout 
of  each  block  in  AO.  Finally,  blocks  A33  and  A34  in  diagrams  A3  and  A4  are 
broken  out  to  show  simulation  program  creation  and  simulation  model  validation. 
The  model  hierarchy  is  presented  in  a tree  diagram  (see  Figure  1). 
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GLOSSARY  FOR  DIAGRAM  A-0 


jropriate  Personnel 


At  any  point  in  the  modeling  process  there  are  two  types  of  personnel 
associated  with  a project.  The  modelers  are  personnel  with  experience  in  modeling 
and  provide  continuity  as  primary  drivers  of  the  process.  At  each  functional 
level  and  step  various  personnel  interact  with  the  project  and  modelers  to 
provide  insight,  critique,  etc.  These  are  secondary  drivers  of  the  process  and 
are  necessary  for  the  primary  drivers  to  perform  an  adequate  job. 

Complete  Problem  Analysis 

The  final  product  which  includes  a verified,  validated  simulation  model, 
complete  statistical  analysis  of  results,  and  conclusions  based  upon  the  model  and 
results,  all  presented  in  a well  documented  report. 

Contract  Specifications 

This  refers  to  contractual  agreements  which  permit  the  modeling  to  begin  and  pro- 
vide an  upper  bound  based  upon  costs,  and  may  include  personnel,  facilities,  time, 
etc. 


Management  Perspective 


The  point  of  view  expressed  by  management  or  the  "client"  about  the  problem, 
objectives,  boundaries,  etc.  Essentially  a management  problem  perspective.  It  may 
or  may  not  be  relevant  but  must  be  considered. 

Physical  System  Data 

Data  from  the  system  being  modeled.  This  may  be  in  many  forms  including 
numerical,  diagramatic,  verbal,  reports,  etc. 

Policies  and  Plans 

This  refers  to  plans,  organizational  policies,  and  regulations  that  may  govern 
the  software  operations  of  affected  organizations.  These  require  general  compliance 
and  may  originate  from  the  government,  contractor,  subcontractor,  etc. 

Problem  Perspective 

The  degree  of  understanding  or  insight  modelers  have  about  the  problem. 
Requirement  Definition 

Provides  a statement  of  need  which  states  why  the  project  is  needed,  what 
will  be  the  purpose  of  the  project  and  justifies  the  approach  to  be  taken  in  this 
project. 


Resource  Constraints 


Refers  to  constraints  placed  upon  an  activity  due  to  availability  of 
manpower,  money,  time,  hardware/software  facilities,  etc.  These  are  working 
constraints  which  operate  within  the  upperbounds  set  by  contract  specifications. 

Simulation  Modeling  Process 

The  act  of  creating  an  abstract  representation  of  an  actual  or  proposed 
system,  implementing  the  representation  as  a mathematical-logical  model  and 
experimentally  manipulating  it  on  a digit  computer.  (PRIT75) 

Standards 


These  refer  to  existing  standards  which  may  require  technical  compliance. 
Technology  Information 

Refers  to  state-of-the-art  information  in  pertinent  technology  and  provides 
guidance  on  methodology  to  be  used,  feasibility  of  an  approach,  etc. 


GLOSSARY  FOR  DIAGRAM  AO 


Appropriate  Support  Personnel 

Personnel  who  interact  with  the  project  and  modelers  to  provide  insight, 
critique,  perform  auxilary  tasks,  etc.  They  are  secondary  drivers  of  the  process 
and  are  necessary  for  the  primary  drivers  to  perform  an  adequate  job. 

Modelers 


Personnel  with  experience  in  modeling.  They  are  primary  drivers  of 
the  process  and  provide  continuity  by  their  association  with  the  project 
throughout  its  lifetime. 

Perform  Design 

This  refers  to  refining  the  preliminary  model  until  the  desired  level 
of  relationship  with  the  actual  system  is  achieved.  A structured  walk-through 
provides  model  validation.  The  simulation  program  is  then  created  and 
validated  to  produce  a validated  simulation  model. 

Preliminary  Model 

An  initial  attempt  to  create  a model  which  results  in  determining  perfor- 
mance measures,  primary  entities,  primary  decisions  and  a model  type. 

The  process  also  provides  a refined  problem  perspective  by  providing  the 
personnel  with  a better  understanding  of  the  system  being  modeled. 

Problem  Formulation 


The  creation  of  a scope  which  provides  focus  for  the  defined  requirements. 
This  includes  problem  definition,  objectives  definition  and  a bounding  process. 
The  result  is  a bounded  problem  with  clear  objectives  and  a refined  problem 
perspective. 

Validated  Simulation  Model 


A simulation  model  which  has  been  tested  and  found  to  be  correct.  Validation 
at  this  point  can  include  replicability,  predictability  and  structural  validity. 
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Simulation  Program 

The  software  embodiment  of  the  validated  model  which  will  be  exercised 
on  a digital  computer. 

Validated  Model 

A model  which  has  been  "inspected"  and  found  to  be  correct.  Validation  at 
this  point  usually  consists  of  structural  walk-throughs  by  personnel 
familiar  with  the  system  being  modeled. 

Validated  Simulation  Model 

A simulation  model  which  has  been  tested  and  found  to  be  correct.  Validation 
at  this  point  can  include  replicability,  predictability  and  structural  validity. 
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GLOSSARY  FOR  DIAGRAM  A1 


Model  Boundaries 


The  implementation  of  the  separability  assumption.  We  define  the  system 
components  as  essentially  separate  from  their  surroundings  creating  a "boundary" 
between  our  system  and  its  surroundings.  This  assumes  that  most  of  the  inter- 
actions between  the  system  being  modeled  and  its  surroundings  can  be  ignored 
(separability  assumption). 

Problem  Definition 


A definition  of  the  problem  or  problems  which  the  project  will  address.  This 
is  similar  to  an  objective  but  does  not  necessarily  have  a goal.  An  objective 
is  a goal  which  is  established  from  a defined  problem. 

Problem  Objectives 

A goal  or  goals  toward  which  the  project  will  be  directed  based  upon  the 
problem  definition.  Essentially  provides  direction  to  the  project. 

Refined  Problem  Perspective 

Improved  problem  understanding  or  insight  derived  through  consta.it  work  or 
contact  with  the  problem. 
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Decide  Formal  Model  Type 

Once  a preliminary  model  is  created  it  can  be  used  in  conjunction  with  the 
refined  problem  perspective  to  decide  what  type  of  formal  model  is  appropriate 
to  obtain  the  project  objectives. 

Other  Model  Types 

Since  this  metamodel  is  only  concerned  with  simulation  models  we  consider 
any  other  formal  model  type  as  an  "other  model  type". 

Performance  Measures 


A metric  which  allows  a system/model 's  performance  to  be  quantified. 

Primary  Decisions 

Those  relevant  decisions  which  impact  the  primary  entities  and  are  primarily 
responsible  for  linking  input  to  output.  This  is  the  implementation  of  the 
causality  assumption  (KAR77) . 

Primary  Entities 

Those  entities  which  are  of  primary  interest.  Specifically,  the  items  which 
drive  the  system.  This  is  an  application  of  the  selectivity  assumption  (KAR77). 

Simulation  Model  Type 

The  formal  model  which  will  be  used  to  solve  for  the  objectives  is  simulation. 
By  simulation  we  are  focusing  on  stochastic  simulation  embodied  by  GPSS,  GASP, 
QGERT,  SIMSCRIPT,  etc. 
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Model  Alterations 


These  are  changes  necessitated  by  deficiencies  identified  by  the  analysis 
process  and  generally  reflect  insufficient  or  incorrect  statistical  data  types 
or  output.  However,  they  are  major  structural  modifications  identified  by 
the  analysis  which  must  be  made  in  order  to  reach  the  objectives. 

Model  Corrections 

These  are  changes  necessitated  by  deficiencies  identified  by  the  validation 
processes.  Usually  they  do  not  require  major  structural  changes,  but  reflect 
minor  inconsistencies  within  the  model. 

Qualitative  Data 

Data  which  is  not  numerical  in  nature.  This  may  include  descriptive, 
graphical,  visual,  etc. 

Refine  Model 


A refinement  of  the  preliminary  model  to  produce  a model  which  has  sufficient 
detail  and  structure  to  allow  the  objectives  to  be  reached.  Model  Refinement  is 
accomplished  within  the  structure  imposed  by  a choice  of  simulation  as  a formal 
model  type. 

Simulation  Model  Run  Time 

The  actual  computer  time  required  to  make  a "good"  run  on  the  computer  of 
the  simulation  model.  A good  run  is  defined  as  a computer  run  which  produced 
results  with  sufficient  user  confidence  to  be  considered  as  representative  of  the 
system  behavior. 

Simulation  Program 

The  software  embodiment  of  the  validated  model  which  will  be  exercised 
on  a digital  computer. 

Validated  Model 

A model  which  has  been  "inspected"  and  found  to  be  correct.  Validation  at 
this  point  usually  consists  of  structural  walk  throughs  by  personnel  familiar  with 
the  system  being  modeled. 

Validated  Simulation  Model 

A simulation  model  which  has  been  tested  and  found  to  be  correct.  Validation 
at  this  point  can  include  replicability,  predictability  and  structural  validity. 
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Completed  Documentation 

Fully  documented  models  and  analysis.  This  includes  models,  programs, 
approaches,  statistical  techniques,  results,  analysis  methods,  etc. 

Computer  Facilities 

This  refers  to  the  physical  facilities  available  to  exercise  the  simulation 
model  and  includes  both  hardware  and  software.  Software  attention  is  directed 
to  statistical  analysis.  Hardware  attention  is  directed  primarily  to  conjestion 
which  impacts  throughput  time. 

Computer  Money 

The  capital  available  to  pay  for  computer  time. 

Project  Funding 

1 The  level  of  project  funding  directly  impacts  the  detail  and  level  of  documentation 

Qualitative  Analysis 

The  analysis  of  the  physical  structure  of  the  system  as  embodied  in  the  model. 

This  is  done  in  conjunction  with  the  refined  problem  perspective  and  identifies 
flaws  in  the  inherent  system  structure,  understanding  of  the  system  synergism 
and  actual  or  potential  problems  which  may  be  incidental  to  the  actual  project 
objectives. 

Quantitative  Analysis 

The  analysis  required  to  insure  that  an  accurate  estimate  of  the  systems 
expected  behavior  is  obtained,  and  the  application  of  techniques  which  allow 
comparison  of  alternatives  when  the  performance  measures  are  random  variables. 

Qualitative  Data 

Data  which  is  not  numerical  in  nature.  This  may  include  descriptive, 
graphical,  visual,  etc. 
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Analyze  Needs 


GLOSSARY  FOR  DIAGRAM  A33 


An  analysis  of  the  software  needs  to  implement  the  model. 


Choose  Simulation  Language 

A simulation  language  is  chosen  whose  structure  will  most  readily 
accept  the  system  model  with  a minimum  of  alterations. 

Define  Requirements 

Define  the  requirements  for  the  model  in  terms  of  software  implementation 
based  upon  defined  software  needs. 

Corrections  of  the  Design 

Corrections  necessary  to  produce  a verified  program. 

Program  & Verify 

Creating  the  computer  program  which  is  the  software  implementation 
of  the  system  model  and  insuring  that  the  computer  program  is  correct. 

Requirements  Corrections 

Corrections  for  the  requirement  definitions  necessitated  because  they 
cannot  be  met  by  current  simulation  languages. 

Simulation  Languages 

Computer  languages  which  provide  a structure  and  terminology  to 
facilitate  the  building  of  simulations. 

Software  Design 

This  is  the  creation  of  detailed  design  and  includes  detailed 
program  specifications,  data  structures,  subprogram  structure,  etc. 
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Perform  Validation 


The  act  of  validating  the  simulation  program 
Validation  Needs 


Specific  validation  needs  which  have  been  identified 


Defined  requirements  for  the  simulation  program  validation. 

Validation  Strategy 

The  strategy  to  be  used  to  implement  the  requirements  of  the  simulation 
program  validation. 
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Predictively  Valid 

When  data  produced  from  the  simulation  program  accurately  predicts 
data  values  from  the  actual  system. 

Replicatively  Valid 

When  data  produced  from  the  simulation  program  matches  historical  data 
acquired  from  the  actual  system. 

Structurally  Valid 

Not  only  does  the  simulation  program  reproduce  the  real  system  behavior, 
but  it  structurally  reflects  the  way  in  which  the  real  system  operates  to  pro- 
duce this  behavior. 
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Analysis  of  the  GPS  Receiver  Loss-of-Lock  Problem 
William  L.  Brogan* 

Participant,  USAF-ASEE  Summer 
Faculty  Research  Program 

The  Global  Positioning  System  (GPS)  concept  involves  the  measure- 
ment of  range  to  four  satellites  whose  positions  are  accurately  known. 
This  allows  the  user  to  determine  his  own  position  with  high  accuracy. 
The  GPS  receiver  measures  these  ranges  indirectly  by  means  of  a mea- 
surement of  the  delay  between  transmission  and  receipt  of  coded  satel- 
lite signals.  Under  some  conditions  the  receiver  fails  to  track  the 
satellite  signals  properly,  that  is,  loses  lock.  It  is  not  always 
immediately  obvious  when  this  has  occurred. 

This  report  presents  four  general  methods  which  might  be  of  use  in 
detecting  loss-of-lock: 

(1)  A runs  test  for  testing  for  randomness  of  various 
tracking  errors. 

(2)  Sequential  maximum  likelihood  ratio  test  for  detection 
of  a valid  range  signal. 

(3)  A non-sequential  maximum  likelihood  test. 

(4)  A Chi-square  test  on  signal  variances. 

All  of  these  methods  derive  from  statistical  hypothesis  testing,  and  are 
intended  to  apply  either  to  Kalman  filter  residual  signals  or  to  re- 
ceiver tracking  loop  errors. 

/ 

Some  of  the  suggested  methods  have  been  illustrated,  using  hypo- 
thetical data.  In  addition,  some  preliminary  tests  have  been  made  on 
the  delay  lock  loop  error  and  on  some  Kalman  filter  residuals  from  a 
simulated  test  of  the  receiver.  Based  on  these  few  results,  no  definite 
conclusions  are  drawn.  The  methods  still  seem  promising,  and  ought  to 
be  given  a fuller  evaluation  when  better  data  becomes  available.  The 
methodology  that  would  be  useful  for  further  evaluation  has  been  pre- 
sented in  this  report. 


*Professor,  University  of  Nebraska 
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1.0  Introduction  and  Background 

The  Global  Positioning  Satellite  System  (GPS)  provides  a means  for 
very  a 'curate  position  determination  by  a wide  class  of  users.  The 
basic  idea  is  to  obtain  measurements  of  the  range  from  the  user  to 
several  satellites  whose  positions  are  accurately  known.  In  principle, 
simultaneous  range  measurements  to  three  properly  dispersed  satellites 
provide  a position  fix. 

The  range  measurements  are  actually  mechanized  as  measurements  of 
signal  one-way  transit  times.  This  involves  the  satellite  clock  and  the 
user  clock,  and  an  unknown  bias  exists  between  them.  The  measurements 
of  range  are  corrupted  by  this  time  bias;  as  a result,  they  are  referred 
to  as  pseudo-range  measurements.  Ideally,  four  pseudo-range  measure- 
ments would  allow  solution  for  the  three  unknown  position  components 
plus  the  time  bias. 

In  reality,  numerous  system  errors  and  signal  path  distortion 
factors  prevent  the  ideal  single-fix  (four  measurements)  situation  from 
giving  satisfactory  navigation  results.  Therefore,  a recursive  filter- 
ing solution  (Kalman  filter)  is  used.  For  a dynamic,  maneuvering  user 
such  as  an  aircraft,  the  result  is  a so-called  integrated  navigation 
system.  The  name  derives  from  the  fact  that  GPS  signals  are  integrated 
with  another  navigation  system,  usually  an  inertial  navigator. 

In  an  integrated  navigation  system,  the  two  distinct  sources  of 
navigation  data  are  blended  together  by  means  of  the  Kalman  filter  to 
obtain  the  optimal  estimate  of  the  system's  state  at  any  given  time. 
Simply  stated,  the  advantages  of  such  a system  are  that  the  GPS  fixes 
can  effectively  limit  the  long-term  navigation  error  growth  which  is 
characteristic  of  inertial  systems.  Also,  the  inertial  system  can  be 
used  to  velocity-aid  the  GPS  receiver.  This  means  that  the  apparent 
shift  of  the  satellite  transmitted  carrier  frequency  due  to  doppler 
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effects  can  be  greatly  reduced  or  compensated  for.  This,  in  turn, 
permits  use  of  much  narrower  bandwidth  user  receivers;  hence,  the  system 
is  less  susceptible  to  jamming*.  An  earlier  study  of  such  a system  is 
found  in  Ref.  [1], 

The  principle  interest  in  this  report  is  the  loss-of-lock  problem. 
The  GPS  receiver  under  consideration  is  the  Generalized  Development 
Model  (GDM) . In  simplest  terms,  it  consists  of  two  error-nulling  track- 
ing loops.  The  carrier  tracking  loop  is  basically  a phase  locked  loop 
and  the  code  loop  is  basically  a delay  locked  loop.  The  errors  in  this 
second  loop  are  errors  in  the  one-way  signal  transit  time  and,  hence, 
are  proportional  to  errors  in  pseudo-range.  Because  of  the  large  amount 
of  sophisticated  signal  processing  (correlation  techniques  between  the 
received  code  and  a locally  generated  code) , it  is  not  always  obvious 
when  the  receiver  is  not  tracking  a valid  signal.  This  loss-of-lock 
phenomenon  could  occur  because  of  jamming  or  for  other  reasons.  This 
report  presents  several  possible  approaches  to  the  problem  of  early 
recognition  of  loss-of-lock  and  other  matters  related  to  the  recovery 
from  this  situation. 

2.0  System  Description  and  Preliminary  Considerations  Regarding  Loss- 
of-Lock. 

2.1  General  System  Description. 

A simple  functional  description  of  the  GPS  system  is  given  in 
Figure  1. 


*Reduced  sensitivity  to  jamming  is  also  due,  in  part,  to  the  highly 
directional  narrow  beam  antennas  and  to  the  spread  spectrum  tech- 
niques used  in  the  receiver. 


Figure  1:  GPS  INTEGRATED  NAVIGATION  SYSTEM 


For  a detailed  description  of  the  receiver  operation.  Ref.  [2  and 
3]  should  be  consulted.  The  radio  frequency  (RF)  signals  input  to  the 
user  antenna  are  pseudo-random  noise  code,  modulated  by  high  frequency 
carriers  (£*1.57  GHz  or  1.24  GHz).  When  the  carrier  loop  is  functioning 
properly,  a phase  coherent  locally  generated  signal  is  used  to  demodu- 
late the  received  signal.  This  is  the  coherent  mode.  The  carrier  loop 
can  lose  lock  because  of  jamming  or  excessively  high  dynamic  maneuvers. 
When  this  happens,  system  operation  can  still  continue  in  a non-coherent 
mode.  In  this  mode,  the  delay  lock  loop  continues  to  operate,  giving 
indications  of  pseudo-range.  The  carrier  frequency  is  computed  using 
IMU  data  (rather  than  being  tracked  with  the  carrier  loop) . 

The  problem  of  interest  here  concerns  the  non-coherent  mode,  and  in 


particular,  the  loss-of-lock  of  the  delay  lock  loop,  i.e.,  the  code 

loop.  The  signals  r of  Figure  1 are  measured  pseudo-ranges  to  the  il 

Pi- 

satellite.  When  a given  channel  loses  lock,  the  r ^ signal  does  not 

normally  dissappear  or  show  an  obvious,  abrupt  change.  Rather,  r . 

pi 

slowly  diverges  from  the  correct  value,  in  a random  walk  fashion.  The 


early  recognition  of  the  phenomenon  Is  important,  because  if  a bad 
channel  continues  to  be  used  as  valid  data,  the  total  navigation  esti- 
mate x can  deteriorate.  Furthermore,  since  portions  of  the  x.  vector 
contain  estimates  of  IMU  errors  (platform  tilt,  etc.)  the  dashed  feed- 
back line  of  Figure  1 can  cause  the  inertial  system  to  be  degraded. 


2.2  A Preliminary  Suggestion. 

The  first  suggestion  made  as  a result  of  this  study  was  to  disable 
the  feedback  to  the  IMU  (at  least  the  level  channel)  at  the  time  of 
down-moding  from  the  coherent  to  the  non-coherent  mode.  This  sugges- 
tion, verbally  conveyed  to  Collins  Radio  during  a June  22-23,  1978,  trip 
to  Cedar  Rapids,  will  be  analyzed  in  more  detail  in  later  sections.  The 
rationale  is  that  (1)  the  system  is  aware  when  down  moding  occurs,  (2) 
down  moding  is  probably  due  to  jamming;  hence,  it  raises  a flag  regard- 
ing potential  difficulties,  (3)  by  disabling  the  feedback  to  the  IMU, 
its  integrity  is  preserved  so  that  the  mission  can  still  be  carried  out. 
The  usual  growth  of  error  in  inertial  systems  will  commence  at  this 
point  in  time,  but  is  rather  slow,  especially  for  an  accurately  ini- 
tialized system.  The  duration  of  the  jamming  threat  should  be  short 
compared  with  the  classical  Schuler  period,  (4)  the  Kalman  filter 
continues  to  update  all  error  states  and  incorporates  them  into  x for 
use  in  receiver  aiding  and  other  navigation-dependent  mission  functions. 
This  would  continue  to  be  true  until  it  is  definitely  decided  that  loss- 
of-lock  has  occurred.  At  that  time,  measurements  from  that  channel  are 
no  longer  processed  by  the  Kalman  filter.  Even  if  all  channels  lose 
lock,  the  system  could  continue  in  an  all-inertial  mode. 

2.3  The  Present  Scheme. 

At  the  present  time,  a reasonableness  check  is  made  on  each  measure- 
ment residual  before  it  is  processed  by  the  Kalman  filter.  If  the 
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magnitude  of  the  residual  exceeds  six  sigma,  as  computed  from  the  filter 
covariance,  that  measurement  is  rejected.  This  procedure  doesn't  always 
give  satisfactory  performance.  The  six-sigma  threshold  is  at  times  too 
large.  (Actually,  the  six  factor  is  what  is  too  large  in  my  opinion.) 

In  some  tests,  especially  single  channel  tests,  loss-of-lock  can  occur 
while  the  measurement  residuals  remain  sufficiently  small  to  preclude 
detection  by  a six-sigma  threshold.  This  could  be  due  to  the  filter 
wrongly  adjusting  certain  states  to  force  the  residuals  to  remain  small. 
(This  would  be  less  likely  in  a full  four  channel  configuration.)  It 
could  also  be  due  to  the  very  slow  rate  at  which  the  apparent  pseudo- 
range measurements  diverge  from  truth  after  loss-of-lock. 

The  six-sigma  threshold  is  also  too  small  in  some  cases.  If  a 
channel  is  once  rejected,  the  remaining  three  channels  adjust  the  state 
estimates  and  lead  to  an  overly  optimistic  estimate  of  the  state  un- 
certainties. The  "sigma"  part  of  six  sigma  becomes  so  small  that  when 
valid  measurements  from  the  fourth  channel  become  available,  they  are 
consistently  rejected. 

2.4  Some  Possible  Approaches  to  Detecting  Loss-of-Lock 

There  are  a number  of  ways  that  might  be  considered  as  a means  of 
detecting  loss-of-lock.  They  differ  in  the  approaches,  assumptions  made 
and  signals  used.  Four  approaches  mentioned  in  Ref.  [4]  are: 

(1)  A priori  signal  strength  estimator.  A measurement  of  the  RF- 
signal  strength  would  be  used,  along  with  estimated  antenna  gain  and  the 
output  of  the  automatic  gain  control  circuitry  (AGC) . The  jamming  power 
to  signal  power  ratio  can  be  estimated.  If  this  ratio  exceeds  known 
limits,  it  is  quite  certain  that  loss-of-lock  has  occurred  or  soon  will. 
This  RF  method  will  not  be  pursued  further  in  this  study.  This  does  not 
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constitute  a value  judgement  of  the  method.  In  fact,  it  could  prove  to 
be  a simple  and  r<  liable  method.  Also,  it  could  potentially  indicate 
when  jamming  has  ceased  and  when  reacquisition  should  be  attempted. 


(2)  Position  Variance  Monitor.  This  method  is  similar  to  the 
present  scheme,  except  it  could  be  applied  to  some  other  signal  (delay 
lock  loop  error,  for  example)  and  the  method  of  determing  the  acceptable 
variance  might  differ. 

I 

. J 

(3)  Signature  Detection.  This  refers  to  a generic  class  of 
methods  which  depend  on  some  characteristic  behavior  of  signals  in  the 
delay  lock  loop  during  loss-of-lock.  The  methods  and  characteristics 
are  "as  yet  unknown". 

(4)  Signal  Presence  Monitor.  This  title  refers  to  classical 
detection  theoretic  methods. 

Concurrently  and  independently  of  Ref.  [4],  several  methods  of 
approaching  the  loss-of-lock  detection  problem  have  been  considered  in 
this  study.  These  methods,  to  be  presented,  may  all  fit  one  or  the 
other  of  the  above  categories,  but  the  correct  assignment  to  a given 
category  is  not  always  obvious. 

All  of  the  following  approaches  grow  out  of  statistical  hypothesis 
testing  in  one  form  or  another.  Some  of  the  relevant  considerations 
are:  What  signals  should  be  used?  Are  they  easily  accessible?  What 
are  their  data  rates?  What  levels  of  confidence  are  desired  on  the 
decisions?  What  are  the  implementation  costs  of  a given  scheme?  During 
jamming,  receiver  bandwidths  are  narrowed,  giving  long  response  times. 

As  a result,  measurement  noise  at  five-second  intervals  is  clearly  not 
uncorrelated.  This  means  that  there  are  system  states  that  are  being 
ignored.  What  is  the  effect  of  these  unestimated  states? 
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By  choice,  consideration  in  the  study  is  limited  to  receiver  output 
signals  (available  once  per  second)  or  to  Kalman  filter  signals  (avail- 
able once  per  five  seconds.)  Some  of  the  tests  suggested  would  apply 
also  to  the  receiver's  delay  lock  loop  tracking  error  (available  every 
20  milliseconds),  but  it  seems  more  difficult  to  get  at  signals  such  as 
this,  buried  inside  the  receiver.  The  questions  of  data  rates  and 
levels  of  confidence  are  intimately  tied  to  the  question  of  how  long  one 
can  afford  to  wait  before  detecting  a loss-of-lock.  The  receiver's 
output  measurements,  at  a one-per-second  rate,  should  be  able  to  detect 
loss-of-lock  much  earlier  than  the  Kalman  signals,  for  a given  level  of 
confidence.  Also,  it  intuitively  seems  wasteful  to  throw  away  four 
receiver  measurements  out  of  every  five  as  is  now  the  case. 

Another  approach  to  the  validation  of  range  measurements,  which  has 
been  tested  on  the  tracking  of  deep  space  shots  [5]  is  to  assume  that 
range  residuals  form  a polynominal  in  time.  A Kalman  filter  is  used  to 
estimate  the  coefficients  of  the  polynominal,  and  to  provide  an  estimate 
of  the  residual  variance.  This  variance  is  then  used  in  a magnitude 
test  similar  to  (2)  above  and  Section  4.2  to  follow.  Although  of 
general  interest,  this  method  does  not  seem  to  be  a reasonable  approach 
in  the  present  application.  It  seems  doubtful  that  a new  Kalman  filter 
is  desirable  to  validate  inputs  to  the  original  Kalman  filter.  If  the 
computer  space  were  available  for  extra  states,  some  of  the  ideas  [5] 
could  be  incorporated  into  the  filter,  and  the  problem  of  neglected 
measurement  noise  correlations  could  be  corrected  at  the  same  time. 

This  would  be  done  by  providing  a better  model  of  range  residual  be- 
havior. 

3.0  A More  Detailed  Consideration  of  the  Kalman  Filter 

When  the  recommendation  of  Section  2.2  is  implemented,  a more 
detailed  representation  of  the  Kalman  filter  portion  of  Figure  1 can  be 
represented  as  in  Figure  2.  That  is,  the  dashed  feedback  signal  path  is 
now  open  so  that  there  is  no  control  input  to  the  IMU. 
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FIGURE  2:  BLOCK  DIAGRAM  OF  THE  KALMAN  FILTER 


The  input  _z  represents  the  set  of  four  pseudo-range  measurements  from 
the  receiver,  corrupted  by  additive  measurement  noise  v.  When  loss-of- 
lock  occurs  in  the  code  loop,  the  receiver  does  not  suddenly  shift  its 
output  from  _z  = r^  + v to  just  noise  v.  Rather,  because  of  the  way  the 
code  loop  discriminator  works,  the  output  gradually  drifts  off  from  its 
value  before  losing  lock.  This  can  be  modeled  as  = w + v where  w is  a 
noise  or  error  term  which  typically  would  diverge  from  r^  after  loss  of 
lock.  It  can  probably  be  modeled  as  a random  walk. 

Simulations  and  lab  tests  have  shown  that,  at  least  in  the  single 
channel  case,  when  code  l-oop  loss-of-lock  occurs,  will  still  track  z_ 
fairly  tightly,  i.e.,  the  residual  is  maintained  at  a small  magnitude. 
This  is  caused  by  the  "corrections"  being  applied  to  the  IMU  (in  the 
usual  case  of  full  feedback  control)  causing  the  IMU  parameters  and/or 
clock  parameters  to  take  on  whatever  wrong  values  are  necessary  to  keep 
the  measurement  residuals  small.  This  means  that  pseudo-range  residuals 
may  not  be  good  signals  for  use  in  detecting  loss-of-lock.  What  is 
needed  is  an  independent  source  of  pseudo-range  estimates  which  can  be 
compared  with  receiver  outputs.  The  IMU/clock  system  can  provide  these 
independent  estimates,  provided  that  GPS  signals  are  not  used  to  re- 
calibrate the  IMU.  This  decoupled  concept,  as  shown  in  Figure  2,  is  the 
suggestion  made  in  Section  2.2.  It  applies  whenever  the  receiver  goes 
from  the  coherent  to  the  non-coherent  mode.  No  significant  reduction  in 
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navigation  accuracy  would  be  expected  because  of  the  opened  feedback 

path.  The  error  states  6X  continue  to  be  estimated  (from  GPS  signals) 

and  used  in  the  whole  value  state  estimates  X «■  x + 6x  (feed  for- 

— — nav.  — 

ward  control).  This  situation  would  prevail  until  it  is  clearly  estab- 
lished that  loss-of-lock  has  occurred.  Thereafter,  the  pure  inertial 
system  output  x would  be  used.  The  remainder  of  this  report  examines 
the  implications  of  the  scheme  just  proposed. 

3.1  Definition  and  Properties  of  Signals  in 
(and  Related  to)  Fig.  2. 

Referring  to  Fig.  2,  the  following  defini- 
tions and  relationships  are  states  without 
comment.  These  will  be  useful  in  the  sequel. 


xn(t) 


x(t) 


x - True  State 

x * IMU  estimate  of  x 
— n — 

6x  = Error  in  x , i.  e. , 6x  = x - x or  x = x + 6x 
n — — — n — — n — 

<5x  ■ Filter  estimate  of  6x;  6X  * X - x 
— — — — n 

X » x + 5X  = est.  of  total  state 
— — n — 

5X  - error  in  6X  = 6x  - <5X 

X " error  in  X - x - X - x + 6x  - (x  + 6X)  * 6x  - 6X 
— — — — n — — n — — — 

6x  = X 

z ■ actual  measurement.  Although  there  may  be  a vector  set  of  measure- 
ments, they  are  processed  one  at  a time.  Therefore,  all  measurement 
quantities  may  be  treated  as  scalars. 
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z - h(x)  + v = h(x_  + 6x)  + v » h(x  ) + H6x  + v when  locked  on 
n — — n — — 

z =•  w + v when  not  locked  on 

v = zero  mean  random  meas.  noise 

w - random  walk  variable;  initial  value  is  range  at  time  of  loss-of- 
lock 

z * predicted  meas.  based  only  on  IMU  = h(x  ) 
n — n 

2 » predicted  meas.  based  on 

= h(x  + 5Jk)  - h(x  ) + Hd£  - z 
n n 

2 = z - 2 » meas.  residual  * h(x 
- H(<5x  - &St)  + v = H 

Az  » z - z - h(x  ) + H6x  + v - 
n n.  — 

* H6x  + v = 2 + H6£ 

2=w  + v-  2*w  + v-  z - H5« 

— — — n 

Az  =■  w + v - z *2  + HiSJfc 


+ GPS  =«  h(*) 
+ H6* 


) + H6x  + v - h(x  ) - H6£ 
n n 

X + v 


h(*n) 


when 
/ locked 


on 


when  not 
locked  on 


3*2  Statistics  of  Signals:  Making  use  of  the  relations  in  Section  3.1, 
the  means  and  covariances  of  the  principal  signals  of  Figure  2 are  as 
follows: 

E(6£)  =*  E&  - Ex  =*  x - x =6x 
— — n — — n — 

cov  (<5A)  * P (Kalman  covariance) 
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E(fix)  = fix  (This  means  that  IMU  errors  are  treated  as  deterministic) 

E (X)  = x + E(5X)  = x + fix  31  x 
n n 

cov(X)  =■  E {(x  - X)  (x  - X)T}  * cov  (X)  = cov  (6X)  = cov  (5X)  * P 
E(X)  - 0 

E(2)  ■ HEX  + Ev  = 0 when  locked  on 

E(2)  - E(w)  - E(z  + H5X) 

— n 

- E(w)  - h (x) 

* h(x  ) - h(x) 

— o 

x * state  (true)  at  time  lock  is  lost 
— o 

E(Az)  - E2  + Hfix  in  both  cases,  but  2 differs. 

cov  (2)  * HPHT  + R when  locked  on 
T 2 

* HPH  + R + a„  (t,  - t ) when  not  locked  on 
n ' k o 

T 

cov  (Az)  - cov  (2)  + HPH  + cross  covariance  terms  (closer  analysis 
follows).  When  locked  on 

Az  - E(Az)  * Hfix  + v - HE(5x)  - E(v)  * v - T (if  fix  is  treated  as  de- 
terministic so  that  Efix  = fix) 

cov  (Az)  - R,  if  fix  is  treated  as  deterministic  error.  (Actually  this 
T 

term  is  R + HP  H , where  P is  the  cov.  of  IMU  errors  fix.)  The 
nav  nav  — 

simpler  result  is  consistent  with  many  other  steps  which  alsc  assumes  fix 
is  a deterministic  error  growth.  The  same  result  follows  when  the 
Az  ■ 2 + HfiX  form  is  used,  but  one  must  note  and  use  E5X  * fix  and 
6X  - fix  * -x  and  use  the  cross  covariances  between  2 and  X. 


When  not  locked  on 


Az  - E(Az)  = w + v - z - (w  + v - z ) 
— — — n — — — n 

= (w  - w)  + ( V - v) 


2 

So  cov  (Az)  = cr  (t  - t ) + R 
n o 

Recap : 

When  locked-on:  E(Az)  * H6x 

(H^: ) cov  (Az)  = R 

When  not  locked  on:  E(Az)  » h (x^)  - h(x(t))  + HSx 

(H  :)  a r(t-t  ) + H<$x 

o 2° 

cov  (Az)  = R + a (t-t) 
n o 

Therefore,  the  differences  between  being  locked  on  and  not  being 

locked  on  are  linear  functions  of  time  for  both  signals  p = E(Az)  and 
2 

a * cov  (Az)  (not  the  same  linear  functions,  however). 

Because  of  the  linear  growing  differences  between  Hq  and  H^,  a hypo- 
thesis test  on  variance  and/or  mean  is  suggested  as  a possible  method  to 
determine  loss-of-lock. 


yes , locked 
on 


lost  lock 


One  such  approach  is  presented  in  Appendix  D. 
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Several  potential  methods  of  detecting  loss-of-lock  are  suggested. 
They  vary  in  the  approach  used,  the  assumptions  made  and  the  signals  to 
be  tested. 

4.1  Tests  For  Randomness. 

Theoretically,  the  measurement  residual  z of  a properly  functioning 
Kalman  filter  is  supposed  to  be  a zero  mean,  white  random  sequence  with 
covariance  equal  to 

cov  [2(k+l|k)]  - HP(k+l|k)HT  + R 

The  "Runs  Test"  of  Appendix  A can  be  applied  to  the  measurement  resi- 
duals to  determine  whether  they  are  random  or  not.  When  the  receiver  is 
locked  on  so  that  valid  range  measurements  are  being  received,  2 should 
be  random.  If  the  test  indicates  some  non-random  deterministic  trend  in 
the  2 then  this  would  be  attributed  to  loss-of-lock.  Note  that  other 
modeling  errors  or  filter  simplifications  might  also  cause  2 residuals 
to  exhibit  non-random  behavior.  Also,  note  that  when  the  receiver 
bandwidth  is  narrow  to  guard  against  jamming,  slow  receiver  dynamic 
response  times  will  cause  a deterministic  transient  component  to  exist 
in  the  2 residuals.  Whether  or  not  this  would  invalidate  the  runs  test 
conclusions  remains  to  be  evaluated. 

In  principle,  the  runs  test  could  probably  also  be  applied  to  the 
delay  lock  loop  tracking  error  as  well.  This  statement  is  based  upon 
the  supposition  that  the  error  in  a properly  functioning  tracking  loop 
will  be  randomly  fluctuating  about  zero. 

The  advantage  of  a runs  test  is  its  extreme  simplicity.  It  does 
not  require  knowledge  of  the  signal  variance.  However,  a fairly  large 
number  of  samples  (25  to  30  or  more)  must  be  tested  in  order  that  the 
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asymptotic  gaussian  properties  be  valid.  These  facts  are  what  prompt 
the  suggestion  of  using  the  delay  lock  loop  tracking  error  signal,  which 
is  available  at  a much  higher  rate,  and  whose  variance  is  probably 
unknown. 

4.2  Simple  Tests  of  Magnitude 

When  the  theoretical  values  for  signal  variances  are  available, 
reasonableness  tests  on  the  magnitude  can  be  made.  This  is  currently 
done  on  the  measurement  residuals  using  a six-sigma  threshold.  Sigma  is 
determined  from  the  Kalman  filter  covariance  results.  Something  of  this 
sort  may  be  combined  with  a runs  test,  as  suggested  at  the  end  of 
Appendix  A.  However,  a single  sample  test  on  each  individual  residual 
is  not  as  statistically  meaningful  as  a test  based  on  a set  of  succes- 
sive samples.  Some  tests  which  depend  on  a sequence  of  samples  are 
given  in  the  following  sections. 

Before  concluding  this  section  on  simple  magnitude  tests,  it  is 
pointed  out  that  once  loss-of-lock  has  occurred,  the  variance  of  the 

measurement  residual  grows  in  a linear  fashion  with  time  (see  Section 

2 

3.2).  The  slope  of  the  growth  is  the  random  walk  variance  an  , which 
would  have  to  be  estimated  from  tests  of  the  loss-of-lock  behavior.  If 
this  opening  of  the  threshold  is  properly  incorporated,  then  the  rejec- 
tion of  valid  pseudo-range  data  at  later  times,  as  discussed  in  Section 
2.3,  should  be  avoidable. 

4.3  Tests  For  Distribution  Parameters 

The  form  of  the  pseudo-range  signal,  and  its  statistical  charac- 
teristics, are  known  when  the  receiver  is  locked-on.  By  hypothesizing  a 
random  walk  model  for  the  receiver  output  after  loss-of-lock,  a second 
set  of  possible  signal  characteristics  are  derived.  Appendix  B gives 


details  of  a sequential  test  on  receiver  output  signals.  Two  thresholds 
are  specified,  depending  on  probability  of  miss  and  probability  of  false 
alarm.  Then,  each  time  a receiver  signal  is  tested,  three  possible 
decisions  exists;  (1)  receiver  is  locked  on;  (2)  receiver  had  lost  lock; 
(3)  no  definite  decision  can  be  made  yet,  so  additional  data  should 
continue  to  be  processed. 

This  test  was  originally  set  up  to  be  applied  to  receiver  outputs 
z,  rather  than  measurement  residuals  z (or  some  other  signal  like  Az). 
The  reason  was  that  z is  available  every  second,  whereas  z is  only 
available  every  five  seconds.  However,  when  the  details  of  Appendix  B 
are  examined,  it  is  seen  that  the  propagated  covariance  matrix  P(k+l|k) 
and  the  linearized  measurement  matrix,  as  well  as  the  residuals,  are 
really  required  at  each  signal  test  time.  Therefore,  although  the 
complete  Kalman  update  is  not  required  every  second,  readout  of  several 
key  Kalman  filter  quantities  would  be  required. 

The  advantage  of  this  sequential  procedure  is  that  it  is  able  to 
detect  trends  that  are  developing,  since  it  works  on  a growing  sequence 
of  data.  The  major  disadvantage  is  that,  as  formulated  here,  the  test 
is  set  up  to  decide  whether  loss-of-lock  has  occurred  at  a known,  speci- 
fied time.  A non-sequential  simplified  version  of  the  same  test  is 
described  in  Appendix  C.  The  intention  is  to  test,  and  make  a decision 
about  loss-of-lock,  every  five  seconds.  This  is  the  Kalman  filter  cycle 
time.  Data  used  in  the  decision  would  be  the  five  receiver  outputs  (one 
per  second)  since  the  last  Kalman  update.  The  performances  of  these 
schemes  when  only  five  signals  are  used  in  the  decision,  will  have  to  be 
evaluated.  This  is  especially  true  in  view  of  the  fact  that  these 
signals  will  be  time-correlated. 

Exactly  analogous  procedures  could  be  developed  and  applied  to 
other  signals,  such  as  Az,  by  modifying  the  appropriate  expressions  for 
means  and  covariances.  Similar  procedures  have  recently  been  proposed 
[6]  for  target  detection,  l.e.,  discrimination  between  warheads  and 
decoys. 


4.4  Test  on  the  Variance  of  the  Az  Signal 


It  has  been  suggested  in  Section  2.2  that  the  integrity  of  an  in- 
dependent IMU-derived  estimate  be  maintained  so  that  GPS  signals  can  be 
compared  with  it.  The  difference,  called  Az,  is  defined  in  Section  3.1 
and  the  mean  and  covariance  of  this  signal  are  given  in  Section  3.2.  It 
is  shown  there  that  the  difference  between  the  covariance  of  a valid 
Az  (receiver  locked-on)  and  a bad  Az  (loss-of-lock  has  occurred)  is  a 
linear,  growing  function  of  time.  Such  a clear  divergence  of  signal 
characteristics  should  be  easily  detectible.  A Chi-Square  test  is 
presented  in  Appendix  D for  this  purpose.  To  allow  earliest  possible 
decisions,  it  is  suggested  that  the  sample  variance  in  Az  be  computed 
for  five  signals  over  a five  second  period.  This  sample  variance  is 
then  compared  with  a threshold  which  is  a constant  (which  depends  on  the 
desired  confidence  level)  times  the  pseudo-range  variance.  This  test  is 
almost  as  simple  as  the  existing  magnitude  reasonableness  test,  and 
should  be  investigated  for  its  effectiveness.  Perhaps  a combination  of 
this  type  of  test  and  the  runs  test  of  Section  4.1  will  prove  to  be 
simple,  timely  and  effective. 
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5 . 0 Results 


Some  preliminary  numerical  results  have  been  obtained,  using  two 
sources  of  data.  First,  the  delay  lock  loop  error  is  considered,  and 
then  the  Kalman  filter  residuals. 

5.1  A Preliminary  Look  at  Delay  Lock  Loop  Errors 

The  delay  lock  loop  error  signal  was  obtained,  in  the  form  of  strip 

chart  recordings,  for  several  cases.  Four  sets  of  error  signals  were 

obtained  by  manually  sampling  some  of  these  curves.  The  resolution  was 

not  good  and  sampling  quantization  errors  are  probably  large  as  a 

result.  In  the  first  two  cases,  the  signal-to-noise  ratio  was 

C/N  ■ 15.7  db-Hz.  The  system  was  in  the  non-coherent  mode,  using  the 
o 

P-code,  and  there  was  no  jamming.  Error  samples  were  read  off  the 
curves  at  3 second  intervals  (the  finest  subdivision  on  the  plot  paper) , 
and  the  errors  were  read  to  the  nearest  meter.  It  is  known  that  in  both 
these  first  cases,  loss-of-lock  did  not  occur. 

When  the  runs  test  described  in  Appendix  A was  applied,  the 
following  results  were  obtained. 

Sample  Mean  Sample  Mean 

Not  Subtracted  Subtracted 

No.  of  No.  of  No.  of 

CASE  Samples  Runs  Test  Statistic  Runs  Test  Statistic 

1 25  6 -2.90  9 -.727 

2 22  3 -2.42  8 -.764 

It  is  seen  that  when  the  sample  mean  s is  not  subtracted  off,  a smaller 
number  of  runs  is  obtained,  and  the  hypothesis  Hq,  (the  samples  are  not 
from  a zero  mean  random  sequence)  is  selected  at  the  .05  level  in  both 
cases.  The  means  were  s ■ -.74m  and  3.77m,  respectively.  In  the  first 
case,  enough  samples  were  near  enough  to  zero  to  cause  the  shift  in  the 
number  of  runs,  even  for  a rather  small  value  of  s.  In  both  cases, 
subtracting  out  the  sample  mean  before  applying  the  runs  test  caused  the 
hypothesis  to  be  selected. 
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When  the  Chi-square  test  of  Appendix  D was  applied  to  the  same  two 

2 2 

sets  of  data,  the  sample  variances  were  determined  to  be  S = 14.815m 
2 

and  25.0m  , respectively.  The  value  to  be  used  for  the  parent  popu- 

2 

lation  variances  is  not  certain.  Collins  uses  R =»  25m  in  the  coherent 

2 

mode  and  increases  that  to  75m  in  the  non-coherent  mode.  Even  when  the 

smaller  value  is  used,  the  critical  value  of  = 45  is  far  larger  than 

2 W 
S so  hypothesis  (valid  tracking  data)  would  be  selected  in  both 

cases. 

Cases  3 and  4 were  again  non-coherent  P-code  cases,  each  with 
C/Nq  * 8 and  with  an  accelerating  user.  In  both  cases  it  is  known  that 
lock  was  lost  (delay  error  larger  than  1 1/2  chips).  Because  of  the 
different  strip  chart  time  scale,  samples  were  taken  only  every  6 sec- 
onds. Results  from  these  two  cases  are  summarized  below: 


No.  of  Samples 
mean  s 

2 

variance  S 
no.  of  runs  without 
subtracting  of  mean 
no.  of  runs  after  sub- 
tracting off  the  means 


Case  3 
21 

3.84 

53.65 

3 

3 

11.29 

2.185 


Test  Statistic  with 
mean  subtracted  -3.794 


Case  4 
33 

-4.23 

36.0 

4 

4 

17.36 

2.80 


-4.766 


In  these  two  cases  the  number  of  runs  is  not  changed  by  subtracting  out 
the  mean  (but  the  values  for  n^  and  n^  were).  In  both  cases  3 and  4, 
the  test  statistic  z is  so  large  as  to  make  it  exceedingly  unlikely  that 
these  samples  came  from  a random  population.  Thus,  Hq  is  accepted. 

This  is  the  correct  decision. 
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With  regard  to  the  Chi-square  test,  the  critical  threshold  at  the 

2 

1%  level  is  1.79R.  This  is  to  be  compared  with  S in  order  to  select 

2 2 2 

either  H or  L.  The  values  S = 53.65m  and  36.0m  were  found  for 

° 1 2 
cases  3 and  4.  Naturally,  the  value  used  for  R is  crucial.  If  R“75m 

is  used,  then  the  result  is  the  selection  of  H.  (a  wrong  decision).  If 

2 1 
R*25m  is  used,  the  correct  decision  H is  made  in  case  3.  In  case  4, 

o 

the  Chi-square  test  does  not  select  the  correct  hypothesis  Hq.  It  is 

noted  that  the  current  6-sigma  reasonableness  test  would  not  have  thrown 

2 

out  any  of  the  data  points  in  case  3 and  4 (even  assuming  R*25m  is 
used) . 

Although  intended  for  use  with  Kalman  filter  residuals,  the  maximum 
likelihood  ratio  tests  of  Appendices  B and  C can  be  applied  to  the  delay 
error  if  a few  assumptions  are  made  to  fill  in  for  some  lacking  infor- 
mation. The  delay  error  samples  are  used  as  if  they  are  measurement 

residuals  2(k)=z(k)-HR(k|k-l)  of  Eq  (B-13) . The  term  z(k)-Hx(0 | -1)  can 

• T 2 

be  approximately  by  z(o)+zAt.  A value  for  HPH  + R of  50m  is  used. 

2 

Finally,  values  for  a and  z must  be  assumed,  as  well  as  values  for  Pw 

n M 

and  Pp. 

• _ 

In  cases  1 and  2,  with  no  user  motion,  a value  of  z“0  was  used.  If 

P *P  “.01,  then  the  two  decision  thresholds  are  -4.595  and  4.595.  If 
Hr  2 2 

Pm“Pf*.05,  they  reduce  to  -2.944  and  2.944.  When  an  “.3(m/sec)  , both 

cases  1 and  2 gave  a value  of  A which  exceeded  the  upper  threshold  of 

2.944  but  not  4.595.  This  means  that  decision  H1  is  reached  at  the  .05 

1 2 

level,  but  no  decision  was  reached  at  the  .01  level.  Increasing  a or 

T n 

decreasing  HPH  +R  caused  the  correct  decision  to  be  made  with  greater 

certainty  after  fewer  samples.  When  the  same  procedure  was  applied  to 

cases  3 and  4,  results  were  very  sensitive  to  the  assumed  value  for  z as 

well  as  cr  ^ and  HPHT+R. 
n 

Because  of  the  coarse  quantization  of  the  delay  error  signals,  and 
because  so  many  assumptions  had  to  be  made  to  fill  in  for  unknown  para- 
meter values,  the  preceding  results  are  quite  tenuous.  While  no  strong 
final  conclusions  can  be  made  about  the  suggested  methods,  the  results 
do  seem  hopeful. 


-r~ 
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5.2  A Preliminary  Look  at  Kalman  Filter  Residuals. 

A digital  tape  of  some  Kalman  filter-related  results  from  Accep- 
tance Test  Plan  48  was  received  from  Collins  Radio  during  the  final  week 
of  this  study.  This  is  the  source  of  the  data  used  here. 

The  period  of  time  from  4863  sec.  (Nav  time)  to  5043  sec.  will  be 
examined.  During  this  period  several  interesting  events  occur.  At  4888 
sec.  the  range  residual  in  Channel  1 suddenly  exceeds  the  6-sigma  test 
and  is,  therefore,  not  used  by  the  filter  for  the  succeeding  period  of 
time.  At  4978  sec.  the  residual  of  Channel  2 also  exceeds  the  6-sigma 
limit  and  is  thereafter  ignored  by  the  filter.  At  5038  sec.  the  re- 
sidual of  Channel  1 drops  back  below  the  6-sigma  threshold  and  there- 
after stays  far  within  that  bound.  Channel  2 stays  outside  the  bound. 
During  this  period.  Channels  3 and  4 have  small  residuals  that  never 
come  close  to  their  6-sigma  bounds. 

The  system  is  operating  in  the  non-coherent  mode.  The  user  is 
stationary  and  the  nominal  signal  strength  is  -169db.  There  is  61db  of 
jamming  power  during  the  entire  time  selected  above  for  analysis. 

Within  the  above  time  range,  35  data  points  were  available  for  each 
channel.  One  sample  is  given  every  five  seconds,  except  that  the  data 
for  4943  sec.  and  4958  sec.  are  missing  for  some  unknown  reason. 

The  first  obvious  impression  from  the  residual  data  is  that  none  of 
the  four  channels  look  like  zero  mean  random  processes  that  a Kalman 
filter  should  theoretically  produce.  In  fact,  Channel  1 has  only  2 sign 
changes  (3  runs)  out  of  35  points.  Channel  2 has  no  sign  changes  (1 
run),  Channel  3 has  4 sign  changes  (5  runs)  and  Channel  4 has  1 sign 
change  (2  runs) . There  is  no  point  doing  the  complete  runs  test  com- 
putation with  this  type  of  data.  Based  on  results  of  Section  5.1  it  is 

felt  that  the  sample  mean  should  always  be  subtracted  at  first.  For 

— 2 
Channel  4,  this  mean  is  s * -5.1287m,  and  the  sample  variance  is  S * 

2 — 

41.77m  . Using  s-s,  it  is  found  that  Channel  4 has  4 transitions  (U*5 
runs),  and  ^*18.486  and  oy  ■ 2.91.  The  test  statistic  is  z * -4.63. 
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This  means  that  if  these  residuals  did  come  from  a random  process  of 
mean  s,  then  a very,  very  unusual  4.63  sigma  event  has  occurred.  In 
fact,  Channel  4 residuals  are  not  random,  but  systematic.  That  is, 
hypothesis  Hq  of  Appendix  A is  selected.  Deciding  that  loss-of-lock  has 
occurred  is  an  incorrect  decision.  There  are  other  causes  for  the  non- 
random  residual  behavior.  Since  the  magnitudes  of  all  Channel  4 resi- 
duals are  quite  small,  this  situation  corresponds  to  occurrance  3 of 
Figure  A-4  (small  error,  but  not  random). 

An  analysis  of  Channel  3 shows  much  the  same  thing.  That  is, 

— 2 2 — 

s=-l. 2504m  and  S “15.382m  . Then  (s-s)  has  5 runs,  and  coincidentally, 
the  same  results  as  Channel  4,  n^“17,  n^™^.  Thus,  the  residuals 
of  Channel  3 are  also  small,  but  not  random. 

T 

The  digital  test  tape  also  lists  the  theoretical  variance  HPH  +R 

from  the  Kalman  filter  for  each  channel.  For  Channels  3 and  4,  this 

theoretical  variance  is  almost  constant  during  the  time  of  interest. 

2 2 

Values  range  from  87  to  92m  . Of  this  total,  75m  is  due  to  the  input 

value  for  R used  by  Collins  in  the  non-coherent  mode.  Two  conclusions 

can  be  drawn  from  this:  (1)  any  test  (such  as  the  Chi-square  test) 

which  depends  on  a larger  than  theoretical  variance  to  signal  loss-of- 

2 

lock  is  not  likely  to  work  very  well  because  the  sample  variance  S is 

so  much  smaller  than  the  theortical  values.  That  is,  the  residuals 

really  do  not  have  the  statistical  properties  that  were  assumed  for  them 

when  setting  up  the  mathematical  models  for  hypothesis  testing.  An  R 

value  of  75  is  far  larger  than  reality;  (2)  with  R“75,  it  follows  that 
T 2 

HPH  is  on  the  order  of  12  to  17m  . It  can  be  concluded  that  an  order 
of  magnitude  basis,  the  Kalman  gain  for  these  channels  is  about  .14  to 
.18.  This  means  that  the  filter  update  is  making  very  small  corrections 
to  the  state  estimates.  Rather,  82  to  86%  of  the  weight  of  the  updated 
state  estimate  is  placed  on  the  propagated  value  of  the  last  estimate. 
Only  14  to  18%  of  the  weight  is  placed  on  information  gained  from  the 
GPS  measurement. 


A closer  look  at  Channels  1 and  2 is  In  order.  They  will  also  fail 
the  randomness  test  (runs  test)  but  this  is  not  very  significant  in  view 
of  the  fact  that  even  the  "good"  channels  fail  this  test.  The  random 
walk  model  of  loss-of-lock  is  clearly  not  applicable  to  the  data  of  this 
case.  The  reason  is  that  in  both  cases  where  the  residual  exceeds  its 
6-sigma  limit  (suddenly)  it  is  caused  by  the  deterministically  computed 
"ionospheric  correction".  For  example,  at  4883  sec.  the  Channel  1 range 
residual  is  -0.193m  and  the  ionospheric  correction  is  17.872m.  At  4888 
sec.  (time  that  residual  first  exceeds  6-sigma)  the  residual  jumps  to 
60.193m  and  the  ionospheric  correction  is  77.963m,  or  60.151m  larger 
than  the  previous  correction.  The  correction  prior  to  4883  sec.  had 
also  been  exactly  17.872m  at  least  as  far  back  as  4753  sec. 

A similar  jump  in  the  Channel  2 ionospheric  correction  (from 
14.49lm  to  77.286m)  at  4978  sec.  accounts  for  the  range  residual  jump 
from  12.282m  to  74.604m.  The  jump  is  62.795m  in  ionospheric  correction 
and  62.322m  in  range  residual. 

Clearly,  the  "corrections"  caused  the  so-called  loss-of-lock  as 
judged  by  the  6-sigma  reasonableness  test.  This,  plus  the  fact  that  the 
filter  residuals  are  not  random,  do  not  have  zero  means  and  have  vari- 
ances far  smaller  than  the  filter  is  told  to  expect,  makes  further 
testing  of  this  data  of  academic  interest  at  best. 

The  reason  for  the  non-randomness  of  residuals  is  the  long  receiver 
time  constant  when  faced  with  jamming  in  the  non-coherent  mode.  Measure- 
ment errors,  and,  hence,  filter  residuals,  are  strongly  time  correlated 
with  correlation  times  on  the  same  order  of  magnitude  as  receiver  time 
constants  (20  to  30  seconds).  This  correlation  is  not  modeled  into  the 

filter.  Instead,  the  value  of  the  measurement  noise  variance  R is  in- 
2 2 

creased  from  25m  to  75m  to  de-weight  the  measurements  and  as  an  ad  hoc 

means  of  coping  with  correlations.  This  increase  in  R is  also  the 

2 2 

reason  why  sample  variances  (S  of  15  to  42m  ) are  so  much  smaller  than 

2 2 

the  filter  predicts  (a  of  87  to  92m  ). 
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In  summary,  non-random  residuals  do  not  necessarily  indicate  loss- 
of-lock,  but  rather  suboptimal  filter  modeling.  Small  residual  vari- 
ances do  not  guarantee  good  tracking  but  may  indicate  an  excessively 
large  standard  of  comparison  due  to  non-optimal  filter  parameters. 

6.0  Conclusions  and  Recommendations 


Methods  of  detecting  loss-of-lock  in  the  Generalized  Development 
Model  of  the  GPS  receiver  have  been  developed.  Some  tests  of  the 
methods  on  available  data  have  been  presented.  Promising  or  hopeful 
results  were  obtained  when  the  delay  lock  loop  error  signal  was  used, 
although  only  a few  test  cases  were  analyzed. 

When  the  methods  were  tested  on  Kalman  filter  residuals,  the  re- 
sults were  not  very  satisfactory  because  the  filter  is  of  suboptimal 
design.  As  a result,  the  residuals  bear  little  resemblance  to  their 
theoretical  model.  This  leads  to  results  which  are  contradictory  to 
expectations.  In  Section  2.3  it  was  stated  that  filter  covariances 
would  be  overly  optimistic.  After  seeing  the  data,  late  in  this  study, 
it  is  clear  that  at  least  the  residual  covariances  are  far  too  pessi- 
mistic. Unmodeled  receiver  states,  and  the  resulting  correlated  mea- 
surement errors  are  probably  the  major  sources  of  trouble.  In  some  ways 
the  ad  hoc  increase  in  noise  covariance  R compounds  the  problem  of  using 
residuals  to  detect  loss-of-lock,  since  this  artificially  causes  a 
great  discrepancy  between  actual  and  theoretical  residual  variances. 

The  reason  for  increasing  R is  to  de-weight  measurements  somewhat 
in  the  non-coherent  mode.  It  could  be  that  they  are  de-weighted  too 
much,  to  the  point  where  the  GPS  signals  and  the  Kalman  filter  are 
having  a very  minor  effect  on  state  estimates. 

It  has  been  determined,  at  least  in  the  few  cases  reported  here, 
that  "loss-of-lock"  as  defined  by  a filter  residual  exceeding  6-sigma, 
is  actually  caused  by  a sudden  jump  in  the  ionospheric  correction  term. 
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A reasonableness  check  on  these  corrections  must  be  added.  This  should 
be  very  easy  to  do.  The  ionspheric  corrections  remain  constant  over 
relatively  long  periods,  and  any  sudden  drastic  jump  should  be  edited 
out.  In  fact,  this  correcion  term  may  have  value  as  an  indicator  of 
jamming. 

The  recommendation  of  Section  2.2  is  reiterated.  The  feedback  to 
the  IMU  should  be  disabled  at  the  time  of  a switch  over  to  the  non- 
coherent mode.  The  measurement  difference  signal  Az  defined  in  Section 
3.1  should  be  investigated  for  its  suitability  in  detecting  loss-of- 
lock.  The  methods  of  this  report  are  suggested.  Lack  of  suitable  data 
prevented  the  Az  signal  from  being  tested  here. 

A combination  of  methods,  such  as  a test  for  randomness  and  a test 
on  magnitude  or  a Chi-square  test  on  variances  may  be  the  best  pro- 
cedure. 

A final  suggestion  for  further  consideration  is  that  the  time 

2 

average  approximation  for  the  residual  variance,  called  S in  the 
report,  could  prove  useful  in  a simple  adaptive  method  of  adjusting  the 
value  of  R in  real  time. 
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Appendix  A 


Runs  Test 

In  order  to  test  the  hypothesis  that  a given  sequence  of  signal 
samples  is  random,  as  opposed  to  having  some  deterministic  trend  in  it, 
a "runs  test"  can  be  used,  as  follows.  Consider  a sequence  of  signals 
s(k)  for  k-1,  2,  . . .,  N.  The  hypotheses  are: 


s(k)  is  not  a zero  mean,  random  sequence 
s(k)  jLs  a zero  mean,  random  sequence. 


(For  a known,  non— zero  mean  m(k) , a test  for  randomness  can  still  be 
applied  by  using  s^k)  = s(k)  - m(k) , that  is  by  constructing  a zero 
mean  process  first.)  Heuristic  motivation:  If  s(k)  satisfies  H^,  then 
one  would  expect  the  samples  to  change  sign  in  some  random  pattern.  Too 
many  sign  changes,  like  every  sample  or  two,  might  indicate  that  (s(k)} 
is  a high  frequency  deterministic  oscillation.  Too  few  sign  changes 
might  indicate  another  type  of  deterministic  trend,  e.g.,  just  one  sign 
reversal  might  indicate  a ramp  or  parabolic  signal  behavior. 


Definition:  A "run"  consists  of  one  or  more  consecutive  samples  of  the 
same  sign.  The  number  of  runs  in  a string  of  N test  samples  will  be 
denoted  as  U,  and  under  , U will  be  random  variable.  The  test  stati- 
stic U has  a density  function  which  asymptotically  approaches  the 
gaussian  density  as  the  size  of  the  sample,  N,  increases.  In  practice, 
if  N exceeds  25  or  30,  the  gaussian  approximation  may  justifiably  be 
used.  This  assumption  will  be  used  here.  The  mean  and  standard  devi- 
ation of  the  random  variable  U are  ^ 
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2n-.ru  (2n,n 


N (N-l) 


where 


n^  =*  number  of  times  s(k)  is  positive 
in  N samples 


= number  of  times  s(k)  is  negative 
in  N samples 


N = n1  + n2 


A sketch  of  the  density  function  f(U),  assuming  is  true  is  shown 


below 


f(u|Hx) 


Fig.  A-l  Density  Function  for  the 
Test  Statistic  U 

If  ^ is  indeed  true,  then  on  a given  test  of  N samples,  the  number  of 
runs  U would  be  expected  to  fall  between  the  limits  [a,  b]  with  a pro- 
bability which  can  easily  be  computed  for  specified  values  of  a and  b. 
If  U exceeds  b or  is  less  than  a,  then  either 


(1) 


does  not  hold  and,  therefore,  Hq  is  choosen,  or 

(2)  A very  rare  probabilistic  event  has  occurred. 

In  practice,  one  would  carry  out  the  following  procedures: 

(1)  Select  N samples  and  observe  n^,  n^,  and  U. 

(2)  Compute  and  c^. 

(3)  Normalize  to  allow  use  of  zero  mean,  unit  variance  normalized 
tables,  i.e.,  z = (U  - Uy)/Oy  becomes  the  test  statistic. 

(4)  Select  the  desired  confidence  level.  This  establishes  how 
large  the  areas  are  in  the  tails  of  the  curve  of  Figure  A-l,  which 
establishes  the  rejection  threshold  for  the  test.  For  example,  if  a 
probability  of  .05  is  used  to  define  what  is  meant  by  "a  very  rare" 
event,  then  we  want  95%  of  the  area  under  the  curves  of  Figure  1 to  be 
between  a and  b.  Selecting  a and  b as  symmetric  with  respect  to  the 
mean  leads  to  a threshold  level  for  z of  about  two  sigma  (1.96  actu- 
ally). This  means  that  if  is  true, 

Pr  (-1.96  <_z  < 1.96)  = .95 

Thus,  if  the  computed  value  for  z in  item  3 satisfies  -1.96  <_  z <_  1.96, 
then  there  is  no  reason  to  reject  and  accept  Hq.  On  the  other  hand, 
if  z is  outside  this  range,  the  conclusion  is  that  Hq  applies,  i.e., 
(s(k)}  is  not  random.  We  accept  the  chance  of  being  wrong  5%  of  the 
time. 

A possible  implementation:  If  the  samples  s(k)  are  encoded  as  1 if 
s(k)  >_  0 and  0 otherwise,  then  an  N-bit  shift  register  could  be  used  to 
maintain  the  most  recent  N samples  on  which  the  test  for  randomness  will 
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be  applied.  The  algebraic  sum  of  the  bits  in  this  shift  register  is  n^. 
N-n^,  gives  n^,  and  the  value  of  U can  be  determined  by  counting  the 
number  of  transitions  from  0 to  1 and  from  1 to  0.  The  number  of  runs 
U and  the  number  of  transitions  t are  related  by 


U 


t + 1 


Thus,  all  quantities  needed  for  the  runs  test  are  easily  obtained. 

As  an  illustrative  example,  the  velocity  residuals  from  a two-state 
extended  Kalman  filter  simulation  with  range-rate  measurements  are  used 
as  the  test  signal  s(k) 


Run  //I 


k 

s (k) 

0 

-.02 

.5 

-3.34 

1.0 

-.49 

1.5 

-2.91 

2.0 

-2.67 

2.5 

-.80 

3.0 

-.67 

3.5 

.65 

4.0 

-.76 

4.5 

.39 

5.0 

-1.67 

5.5 

-.51 

6.0 

.99 

6.5 

3.6 

7.0 

2.77 

7.5 

1.61 

8.0 

-.15 

8.5 

-.62 

9.0 

-1.15 

9.5 

-1.86 

10.0 

-.96 

2 

3 

4 

5 

6 


U = 7 runs,  transitions 
N - 21  samples 


6,  n. 


15 


Table  A-I  Some  Example  Statistics,  Showing  the  Runs 
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The  question  is,  are  these  residuals  random  (i.e.,  is  the  filter  working 
properly)  or  is  there  a systematic  trend,  due  to  a modeling  error  or 
some  other  cause? 


The  runs  test,  at  the  5%  level,  is  applied. 

2(6) (15) 

My  - 21  + 1 = 9.57 

Oy  - j 3.24  = 1.8 

” the  test  statistic  is 

z = (U  - = -1.4 

Since  -1.96  z <_  1.96,  there  is  no  justification  for  saying  that  this 
sequence  is  non-random  (even  though  there  are  several  long  strings  of 
consecutive  negative  residuals) . 

Note  that  the  runs  test  does  not  require  any  knowledge  of  the 
statistics  of  the  signal  samples  s(k)  beyond  the  assumption  of  zero 
mean.  Therefore,  this  test  could  be  applied  to  any  receiver  signal 
which  is  supposed  to  be  random  when  the  code  is  being  tracked ; for 
example,  the  delay  error  in  the  delay  lock  loop.  The  advantage  of  using 
this  signal  as  opposed  to  the  Kalman  filter  pseudo-range  residuals  is 
the  availability  of  data  at  a higher  rate.  Thus,  N samples  can  be 
tested  in  a relatively  shorter  time. 

Disadvantages  associated  with  the  "runs  test"  approach  to  detecting 
loss-of-lock  are: 

(1)  It  may  indicate  that  the  test  signal  is  non-random  because  of 
a relatively  small  and  tolerable  system  bias  as  sketched  in  Figure  A-2. 
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Fig.  A-2  A Biased  Signal 

(2)  It  may  indicate  that  a wildly  erratic  signal  is  indeed  random 
although,  in  fact,  the  error  levels  are  so  large  as  to  preclude  any 
useful  processing  of  the  signal.  Figure  A-3  represents  such  a sample. 


Fig.  A-3:  A Random  Signal  With  Wild  Fluctuations 

Both  of  these  possible  defects  could  be  overcome  by  combining  a mag- 
nitude threshold  test  with  the  runs  test.  Figure  A-4  shows  how  such  a 


combination  might  be  used: 


(1)  yes 


H. : Random  Thresholdr 

Test 


ls(k)llT  (2)  no 


8 (k)  Runs 

I *■  Test 
samples 


good  (locked  on) 


bad  (random  erratic 
range  errors 


still  O.K.  small 

H : Not  Random  frhresholc y6S  bias.  May  be  in- 
o rj,  . dication  of  be- 

-■  ■■  lest  ginning  of  problems 

I s (k)  ! <r 

—■■J  n°  divergence 

(loss-of-lock) 


Fig.  A-4:  Combined  Runs  Test  and  Threshold  Test 


Possible  outcome  (1)  indicates  proper  behavior;  therefore,  the  system  is 
locked-on.  Outcome  (4)  indicates  pretty  clearly  that  loss-of-lock  has 
occurred.  Outcomes  (2)  and  (3)  are  slightly  more  ambiguous,  but  (2) 
probably  indicates  loss-of-lock  and  certainly  indicates  unacceptable 
system  operation,  for  whatever  reason.  Outcome  (3)  indicates  that  the 
system  is  probably  still  working  satisfactorily.  This  could  mean  there 
is  some  small  system  bias,  but  it  might  also  indicate  the  beginning  of  a 
slow  divergence  and  the  situation  should  continue  to  be  carefully 
monitored. 


Appendix  B* 


Sequential  Likelihood  Ratio  Test  for  Detecting  Loss-of-Lock 

The  problems  being  considered  is  the  determination  of  whether  the 
measurement  (receiver  output)  is  a valid  pseudo-range  measurement  or 
some  sort  of  random  walk  because  loss-of-lock  has  occurred.  That  is, 
one  of  two  alternative  hypothesis  must  be  selected. 


or 


H : z ■ w + v 
o 


4 rT 


(B-l) 


where  v is  random,  white  zero  mean  noise  with  variance  R and  w is  a 
random  walk  variable. 


w - 


n(t)dT  + rp(o) 


(B-2) 


The  noise  n is  also  white,  zero  mean  with  variance  an  (latter  assumed 

constant)  and  rp(o)  is  the  pseudo-range  which  existed  at  the  time  of 

loss-of-lock.  Also,  t^  is  the  time  bias,  c is  the  speed  of  light,  x is 

the  system  state  vector  and  H is  the  linearized  measurement  matrix.  If 

z is  assumed  guassian,  then  its  density  function  is  determined  by  its 

first  two  moments.  First  consider  the  situation  when  H is  true.  Then 

o 


t 

E{z}  ■ E{w}  + E{v}  ■ J' 

o 


E{n}dr  + E{r  (o) } + E{v> 
P 


(B-3) 


*In  this  appendix,  a linear  relationship  between  state  and  measurement 
is  used  only  for  simplicity  of  exposition. 


Since  n and  v have  zero  means 


E(z}  - E{r  (o)}  = H(0)  x(0) 

P 

_ 2 

Also,  cov{(z)}=*  E{ [z  - z]  },  but 

t 

z - zf  ■ J'  n(x)dx  + v + H(o)  [x(o)  - x(°)l 
o 

so  that  under  hypothesis  Hq 

cov  (z)  = H(o)  cov  [x(o)]  H^(0)  + R + a 2t  (B-4) 

The  time  o is  being  used  to  indicate  the  point  where  loss-of-lock  oc- 
curred, and  not  some  initial  problem  time.  Also,  note  that  uncondi- 
tioned expectations  have  been  used  above,  so  that  cov  [x(o)]  is  not 
P(o|o)  or  P(o|-l),  the  conditioned  covariances  of  the  Kalman  filter. 
However,  we  will  ultimately  switch  to  a recursive  formulation  and  then 
it  will  be  seen  that  quantities  from  the  Kalman  filter  will  become 
involved . 

Now  consider  to  be  true.  Then 

E{z}  - E{rp(t)}  - to(t)  (B-5) 

and 

cov{z}  * H(t)  cov  [x(t)]HT(t)  + R (B-6) 

The  two  possiiblities  Hq  and  lead  to  two  separate  density  functions 
f (z | Hq)  and  f (z | H^) . Under  the  gausslan  assumption,  the  above  means  and 
covariances  suffice  to  completely  specify  these  density  functions. 

Define  the  likelihood  ratio  as 

f(z  Hx) 

L1  £ f (z  Hq)  (B-7) 


34 


Figure  B-l:  The  Two  Possible  Density  Functions 


Clearly  at  z~  we  have  L'*’  > 1 and  a maximum  likelihood  decision  rule 
would  select  alternative  in  this  case. 

Actually,  the  decision  is  to  be  based  on  a sequence  of  measure- 
ments, 2 (k)  ■ {z(l),  z(2),  . . .,  z(k)}  rather  than  on  a single  measure- 
ment z(k).  Thus,  a modified  likelihood  ratio  is  defined  as 

, A (B-8) 

f(2|Ho) 

Note  that  f(2(k)|H1>  =>  f^k)!^,  2(k-l))f  (2(k-l)  |h  ) . This  can  be 
repeatedly  factored,  leading  to 

f(2(k)|Hi)  - f(z(k)|Hi,  2(k-l))f(z(k-l)|Hi,  2(k-2)  . . .* 

*f(z(2)|Hi,  2(l))f(z(l)|H1) 
k 

* f(z(l)|H  ) ir  f (z(j)  |H, , 2(j-l))  (B-9) 


The  same  form  holds  for  both  1=0  (Hq)  and  1=1  (H^) . 


The  natural  log  of  L(k)  is  defined  as  A(k),  and  can  be  written  as 
A (k)  = ln{L(k) } 

= In  [f (3(k)  lH±)]  - In  [f (a(k) |Hq) ] 

= In  (z (1) | Hx)  - In  (z(l)|Ho) 
b 

+1  [In  { f (z  (j ) | H- , a(j-l)) } - ln{f(z(j)|H  , 3(j-l))}] 
j=2  x 0 

= A(k-l)  + lntfCzCk)!^,  a(k-l))} 

- In  {f(z(k)|HQ,  a(k-l)) } (B-10) 

If  the  original  density  functions  are  gaussian,  then  so  are  all  the 
densities  conditioned  on  past  measurements.  The  results  of  Eq  (B-3) 
through  Eq  (B-6)  must  be  modified  slightly  to  obtain  the  moments  for 
f(z(k)|H^,  a(k-l))  for  example.  The  modification  consists  of  using 
E{ • | a(k-l)}  instead  of  the  unconditioned  expectations  used  earlier. 
Thus,  the  former  x terms  become  E{x(k) | a (k— 1) },  which  is  the  usual 
Kalman  estimate  ft(k|k-l).  Also,  cov  [x]  terms  become  cov  [x(k) [ a(k-l) ] 
which  is  the  usual  Kalman  filter  covariance  P(k|k-1).  These  manipu- 
lations lead  to 

exp  (-1/2  [z(k)-H^(k|k-l)1TrHPH+R]~1rz-Hx]} 
f(z(k)|H1,  a(k-l))  = (2Tr)m/2|HP(k|k-1)HT+R|l/2 


Since  each  receiver  channel  is  considered  separately,  m=l,  z is  a scalar 
and  the  above  expression  simplifies  to 


exp  {-1/2  f z-HX(kjk-l| ~/fHPH+R]  ■ 

£(z(k)  IH,  a(k-l)  - m [HP(k|k-l)HT«I1/2  <B'n> 

When  the  same  kinds  of  manipulations  are  applied  to  f(z(k)|HQ, 

Z(fc-l)),  the  result  is 

f (z (k) | Hq , Z(k-l)) 

exp  {-l/2[z(k)-Hx(o  |-1)  ]2  /[H(o)P(o  |-l)HT(oHR+an2(tk-to)  ] } 
VT?  {H(o)P(o|-l)HT(o)+R-hjn2(tk-to)}  1/2  (B-12) 

Note:  t is  used  to  indicate  the  time  loss-of-lock  occurs,  and  not  some 
o 

initial  time.  It  could  be  any  time  in  the  range 


Taking  natural  logs  gives  the  log-likelihood  ratio  in  a recursive  form 
A(k)  = A(k-l)  + 1/2  In  {H(o)P(o|-1)HT(o)  + R + (tfc-to) } 

-1/2  In  {H(k)P(k|k-l)HT(k)+R} 

+ 1/2  [z(k)-HR(o|-l)]2/[H(o)P(oj-l)HT(o)+R-K7n2(tk-to)]> 

- 1/2  [z(k)-HR(k|k-l)]2/[H(k)P(k|k-l)HT(k)+R]  (B-13) 

An  initial  value  A=0  starts  the  process.  This  says  that  initially 
the  presence  or  absence  of  a valid  pseudo-range  signal  is  equally 
likely.  The  quantities  P(k|k-1),  z(k)  and  k(k|k-l)  are  the  usual  cur- 
rent values  of  covariance,  measurement  and  state  estimate  from  the 
Kalman  filter.  The  same  quantities  with  a zero  time  argument  in  place 
of  k,  apply  at  the  time  that  loss-of-lock  is  presumed  to  have  occurred. 


(B-14) 


In  order  to  use  A (k)  in  a sequential  hypothesis  test 
thresholds  tq  and  must  be  specified.  Then; 


if  A(k)  In  (tq)  accept  Hq 
if  A(k)  >_  In  (t^)  accept 

if  In  (tq)  < A(k)  < In  (t^)  continue  testing, 
no  decision  yet. 


The  decision  thresholds  are  selected  as  follows:  Define  P.,  as  the 
probability  of  miss  (P^  =*  Probability  that  hypothesis  Hq  is  accepted 
when,  in  fact,  is  true;  i.e.,  we  miss  the  fact  that  a valid  range 
measurement  is  present) . Also  define  a probability  of  false  alarm  P 

r 

(P_,  = Probability  that  H,  is  accepted  when  in  fact  H is  true;  i.e.  it 

r L O 

is  falsely  concluded  that  a valid  range  signal  is  present) . Acceptable 

numerical  values  must  be  specified  for  Pw  and  P„.  These  are  then  used 

M F 

to  find  the  necessary  thresholds 


o 1-PT 


(B-15) 


Figure  B-2  illustrates  the  use  of  the  sequential  likelihood  ratio  test, 


ln(Tx)  


ln(to) 


Figure  B-2  A Possible  Sequence  of  A(k)  Values 


For  the  situation  shown  in  Fig.  B-2,  the  two  alternate  hypotheses  are 
that  loss-of-lock  has  occurred  at  time  0 and  that  loss-of-lock  has  not 
occurred.  The  first  six  samples  yield  no  conclusion,  since  neither 
threshold  is  crossed.  Only  at  sample  7 is  a definite  decision  made,  and 
in  this  case  it  is  that  is  true,  loss-of-lock  has  not  occurred.  On 
another  set  of  sample  measurements,  A(k)  might  cross  the  lower  threshold, 
in  which  case  Hq  would  be  accepted;  that  is,  it  would  be  decided  that 
loss-of-lock  did  occur  at  time  0. 

In  the  GPS  receiver  problem,  complicating  issues  arise  regarding 
the  above  procedure.  First,  two  simple  alternatives  do  not  really 
describe  the  true  situation.  In  relationship  to  Figure  B-2,  for  ex- 
ample, the  questions  of  whether  loss-of-lock  occurred,  (not  at  k*0,  but 
at  k=l  or  2 or  3 or  4 or  5 or  6,  . . . have  not  been  considered.  The 
preceding  developments  assume  that  a single,  known  time  k=0  is  being 
tested  for  loss-of-lock.  How  is  such  a time  selected  for  test,  and  what 
about  the  other  possible  times?  It  would  be  interesting  to  test  the 
sequential  decision  procedure,  via  simulation,  in  cases  where  the  time 
loss-of-lock  occur  can  be  controlled.  These  simulated  tests  could  prove 
or  disprove  the  feasibility  of  the  sequential  likelihood  ratio  test  and 
could  shed  light  on  how  long  it  takes  before  loss-of-lock  can  be  de- 
tected in  this  idealized  case. 

A more  pragmatic  approach,  using  Eq  (B-13)  of  this  appendix  but  in 
a non-recursive  manner  on  a fixed  set  of  data  points,  is  suggested  in 
Appendix  C. ' 
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A Nonsequential  Likelihood  Ratio  Test  For  the  GPS  Receiver 

Each  channel  of  the  GPS  receiver  produces  an  output  once  per 
second.  These  are  sampled  and  processed  by  the  Kalman  filter  once  every 
five  seconds.  Four  out  of  each  five  receiver  outputs  are  not  used  at 
present.  It  is  suggested  here  that  all  receiver  outputs  be  used  in 
detecting  loss-of-lock.  Let  kK)  represent  a receiver  output  which  is 
sampled  for  use  in  the  filters.  The  next  sample  which  the  filter  will 
process  is  k=5,  as  shown  below. 


Filter  Filter 

Sample  Sample 


0 1 2 3 4 5 


Receiver  Outputs  at  these  times 

In  deciding  whether  or  not  z(0)  is  a valid  pseudo-range  measurement,  the 
five  measurements  a - {z(0),  z(l),  z(2),  z(3),  z(4)}  can  be  used  in  a 
non-re cursive,  fixed  data  span  mode  as  described  below.  Then,  the  whole 
process  can  be  repeated  using  2 = {z(5),  z(6),  z(7),  z(8),  z(9)}  for  the 
next  filter  cycle,  and  similarly  for  succeeding  cycles.  In  this  mode  of 
operation,  the  two  alternate  hypothesis  of  Appendix  B are  valid  and 
appropriate  at  each  step  (since  a decision  will  bo  made  about  loss-of- 
lock  for  every  Kalman  filter  input).  Eq  (B-13)  for  computing  A(k)  is 
still  valid,  but  now  just  five  sets  of  measurements  z(k)  will  be  used 
and  a decision  is  only  made  once  for  each  set  of  five.  At  the  beginning 
of  each  set  of  five  samples,  is  initialized  to  zero.  In  order  to  per- 
form such  a test,  it  is  clear  that  P(k|k-1)  and  R(k|k-1)  will  be  needed 


► 

l 

l 

I 

[ 
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at  four  times  when  they  would  not  normally  be  used  by  the  Kalman  filter. 


Since  no  Kalman  updates  are  done  between  k=0  and  k*5,  it  is  clear,  for 
example,  that  P (3 | 2)  - P( 3 | 0)  and  k(3|2)  =»  &(3|0).  Thus,  it  appears 
that  all  needed  quantities  can  be  obtained  by  just  interrupting  the 
normal  time  propagation  process  at  the  appropriate  times.  No  extra 
computations  are  required. 


After  accumulating  A for  a complete  set  of  five  measurements,  the 
test  for  loss-of-lock  is 


accept 

A * 0 (C-l) 

accept  H 


Equation  (C-l)  is,  of  course,  entirely  equivalent  to  saying  that  H..  is 
accepted  if  the  likelihood  ratio  L of  Eq  (B-8)  is  greater  than  1 and  Hq 
is  accepted  otherwise. 

• t 


i 
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X Teat  on  the  Signal  Az. 


Given.  N samples  of  Az^,  compute  the  sample  mean 


N 

Az  - 1 Z Az  (D-l) 

N 

i-1 

and  the  sample  variance 

N 

S2  - N Z (Az  - H)2  (D-2) 

i-1  1 

Then,  the  test  statistic 

C - NS^_  • (D-3) 

R 

2 r q i 

has  a x distribution  with  N-l  degrees  of  freedom  1 J . This  distri- 
bution is  sketched  below 


A value  of  Cc  can  be  selected  such  that  the  cross-hatched  area  in 
the  tail  is  the  desired  probability  of  rejecting  when  it  is  actually 
true.  This  probably  is  PM  of  Appendix  B.  For  example,  if  N-5,  then 
5c  -13.28  for  P^-.Ol  and  ? =9.49  for  P^-.OS.  (These  values  loosely 

correspond  to  three-sigma  and  two-sigma  levels  of  confidence  in  the 
gaussian  case.) 


For  a particular  sample  value  for  5 of  Eq  (D-3) ; 


if  C > C , the  decision  is  to  accept  H 
c o 

if  ? £ 5c  , the  decision  is  to  accept 
Numerically,  if  N=5  and  P =.01  then  £ = 13.28  so  that  the  critical 

f\  ii  C 

threshold  for  S is  5 R/N  * 2.66R  = 3R. 

c 

As  a more  concrete  and  complete  example,  consider  the  21  samples 

listed  in  Table  A-I.  The  sample  mean  is  computed  as  -0.40762  and  the 

2 

sample  variance  is  S ■ 2.81384.  Suppose  a probability  of  miss  P =.01 

M 

is  assigned.  Then  from  chi  square  tables,  5 =37.57.  Suppose,  purely 

c 

for  the  sake  of  this  illustrative  example,  that  the  samples  come  from  a 

population  whose  variance  is  R*1  (if  loss-of-lock  has  not  occurred, 

2 

i.e.,  if  H,  is  true).  We  thus  compare  S = 2.81384  with  £ R/N  = 

J-  2 ^ 

37.57/21  a 1.79.  Since  S exceeds  this  threshold,  we  conclude  with  a 

probability  of  0.99  that  the  population  variance  must  be  larger  than 

R=l.  This  means  we  accept  hypothesis  Hq  that  loss-of-lock  has  occurred. 

Note  that  for  this  same  set  of  hypothetical  parameter  values  the 

existing  six-sigma  test  would  not  have  rejected  any  of  the  21  samples. 

They  all  have  magnitude  less  than  6R=6  . In  fact,  only  two  samples 

exceed  a three-sigma  threshold. 

This  sample  was  hypothetical  because  the  value  R=1  was  selected 

only  to  make  a point.  The  21  samples  are  really  Kalman  filter  residuals 

T 

and  their  theoretical  variance  is  of  the  form  HP(k+l  k)H  + R,  which  is 
a time-varying  quantity.  In  the  case  which  generated  the  samples  of 
Table  A-I,  this  residual  variance  was  decreasing  but  had  an  average  of 
about  20  to  25  over  the  time  span  of  interest.  If  this  value  is  used 
instead  of  the  formerly  assumed  value  of  1,  hypothesis  would  be 
accepted. 

2 

Note  that  the  x test  is  suggested  for  the  signal  Az  rather  than 
the  residual  2.  This  is  because  the  variance  of  Az  does  not  depend  on 
the  Kalman  covariance  matrix  P,  but  only  on  the  measurement  noise  co- 
variance  R,  which  is  constant.  See  Section  3.2. 
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are  presented. 
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I.  PROLOGUE 


In  the  past  several  years,  the  need  for  developing  better  cathodes  has 
been  realized.  The  expected  demand  for  cathodes  that  have  the  extreme  life- 
times required  for  satellite  applications,  the  high  current  densities 
necessary  for  radar  and  millimeter  wave  applications,  and  improved  reli- 
ability for  all  applications  has  led  to  increased  research  in  numerous  types 
of  cathodes.  The  Avionics  Laboratory  has  particular  interest  in  nickel 
matrix  cathodes  especially  the  Medicus  cathode.  It  is  hoped  that  these 
cathodes  can  fulfill  the  emission  requirements  of  microwave  tubes  in  the 
near  future.  This  report  describes  a facility  under  development  for  testing 
cathodes  and  investigating  some  of  their  important  properties. 

II.  OVERALL  OBJECTIVES 

After  preliminary  experimentation  and  familarization  with  the  problems 
associated  with  cathode  characterization,  two  related  objectives  were  defined. 

We  decided  to  concentrate  our  efforts  on  improving,  refining  and  in  part  auto- 
mating the  thermionic  emission  facilities  that  were  available  in  this  laboratory. 
These  facilities  are  to  be  used  in  the  normal  characterization  of  cathodes. 

In  addition  they  are  to  be  used  in  conjunction  with  experiments  to  learn  more 
about  the  underlaying  physics  of  the  activation  and  emission  processes  that 
occur  in  the  cathodes.  Along  these  lines  it  was  decided  to  design  an  experi- 
ment to  study  the  poisoning  effects  of  various  gases  on  the  cathodes.  In 
summary,  the  objectives  are 

1.  to  provide  an  improved  facility  for  thermionic  emission  studies 

2.  to  examining  the  effects  of  various  gases  on  the  performance  of 


the  cathodes 
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III . BACKGROUND 

A.  The  Medicus  Cathode 

This  cathode  is  a high  work  function  metal  with  an  interdispersed 
low  work  function  emissive  material.  It  is  designed  to  furnish  high  values 
of  continuous  current  at  relatively  low  temperatures.  By  virtue  of  its 
metal  - nickel  - matrix  it  is  rugged  and  can  be  easily  fabricated  and 
shaped.  Its  fabrication  is  characterized  by  a rolling  and  annealing  cycle. 
Preimpregnated  nickel  and  alkaline  earth  carbonates  are  sintered  on  a nickel 
base  and  rolled  to  about  half  the  thickness  in  several  steps.  Between  each 
step  the  cathode  is  annealed  in  hydrogen. 

To  complete  the  processing  of  the  cathode,  it  must  be  activated.  This 
is  usually  accomplished  in  two  steps.  The  cathode  is  heated  in  vacuo  until 
the  carbonates  are  reduced  to  oxides.  This  is  usually  completed  by  the  time 
the  system  has  reached  800°C  to  850°C.  The  second  step  consists  of  holding 
the  cathode  at  approximately  850°C  and  drawing  current  by  applying  a voltage 
across  the  cathode  and  an  anode  arranged  together  as  a diode.  Sufficient 
voltage  is  applied  so  that  the  diode  is  operating  in  the  transition  region 
between  the  temperature  limited  and  space  charge  limited  operating  regimes. 
For  a good  cathode,  the  current  will  increase  with  time  eventually  reaching 
a fixed  value.  The  voltage  is  then  increased  so  that  the  diode  is  again 
operating  in  the  transition  region.  When  the  current  has  reajusted  to  a 
new,  constant  value,  the  voltage  is  again  increased. 

This  process  is  repeated  until  no  further  increase  in  current  occurs. 
The  cathode  is  activated. 
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B . Thermionic  Emission 


Thermionic  emission  is  the  process  whereby  electrons  escape  from  a 
hot  metallic  surface  into  a vacuum.  If  the  surface  is  used  as  a cathode  and 
all  the  emitted  electrons  are  collected  by  an  anode,  the  cathode  is  said  to 
give  saturated  emission.  The  current  density  in  this  case  is  called  the 
saturated  current  density  Jg.  The  electrons  that  escape  from  a metal  are 
those  whose  initial  energy  was  about  equal  to  the  Fermi  energy.  If  these 
electron  acquire  an  additional  amount  of  energy  <j>,  called  the  work  function, 
they  can  just  escape  the  metal.  The  relation  between  the  saturated  current 
density  Jg  and  this  "barrier  height"  <p  is  given  by  the  Richardson  equation 

2 

T 4iremk  „2  , cf> 

Jso  = -^3—  T exP(-  kT  } 

= A T2  exp  (-  |f) 

where  the  symbols  have  their  standard  definitions.  Quantum  mechanically,  a 
small  fraction  of  the  electrons  incident  on  the  surface  of  the  metal  from 
within  will  be  reflected  so  that  the  equation  above  should  be  multiplied  by 
a transmission  coefficient.  The  theoretical  value  of  A is  rarely  obtained 
in  actual  practice  partly  because  the  work  function  4>  is  temperature  depen- 
dant. A common  practice  is  to  insert  the  theoretical  value  of  A into  the 
equation  and  to  regard  the  equation  as  the  definition  of  the  effective  work 
function  of  the  material  <J)^. 

Once  an  electron  escapes  from  a surface,  it  is  strongly  attracted  back 
to  that  surface  by  the  image  positive  charge.  To  collect  the  electrons  then, 
an  electric  field  must  be  imposed.  For  low  values  of  the  field,  only  a 
fraction  of  the  electrons  are  collected,  the  remainder  forming  a space  charge 
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cloud  near  the  cathode.  One  talks  of  space  charge  limited  emission  in  this 
case.  The  current  density  calculation  must  include  the  contribution  of  this 
space  charge  to  the  potential.  This  requires  the  solution  of  Poissons 
equation  for  the  particular  geometry  used.  An  approximate  solution  for  a 
planar  diode  is 

j 1 .„<§*/«  <2 

j o m .z  . 

a 

This  is  Child's  Law.  V is  the  anode  voltage  and  d the  cathode  - anode 
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spacing.  A plot  of  J vs  V is  often  used  to  verify  diode  performance. 

The  presence  of  an  external  field  counteracts  the  image  charge  force  on 
escaping  electrons.  This  has  the  apparent  effect  of  reducing  the  work  function 
of  the  material.  This  is  the  Schottky  effect  whereby  the  saturated  emission 
current  is  a function  not  only  of  the  temperature  and  work  function  but  of 
the  applied  field.  The  emitted  current  in  the  saturated  region  is  given 
now  by 

t „ , eq  / E , 

Js  = Jso  eXP  [ kT"  1 

where  E is  the  field  strength  and  q is  a geometrical  factor.  This  indicates 
that  the  Richardson  equation  holds  only  for  zero  applied  field.  To  obtain 
effective  work  functions  the  above  equation  must  be  used  to  obtain  the  zero 
field  current  densities  which  are  then  used  in  the  Richardson  equation. 

A number  of  complications  have  been  ignored  in  the  above  discussion. 

Patch  effects,  work  function  differences  for  different  crystal  planes,  semi- 
conducting surfaces  and  surface  states  are  additional  concerns  which  would 
lead  us  far  astray  here.  Suffice  it  to  say  that  the  above  equations  are 
useful  in  organizing  and  comparing  emission  data  for  our  cathodes. 


■*  v •**'  * / , 
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IV.  INITIAL  EXPERIMENTS 


Some  preliminary  experiments  were  performed  on  cathodes  using  the  optical 
and  electrical  measurement  facilities  that  were  available. 

From  a four  point  measurement  of  a piece  of  cathode  material  a resistivity 
value  of  2.3  x 10  ^ft-cm  was  deduced.  This  is  decidely  metallic!  Current 
emission  theories  emphasize  the  semiconducting  nature  of  the  emitting  surface. 
Apparently  this  surface  is  quite  thin  and/or  recessed  with  metallic  nickel 
peaks  making  contact  with  the  measuring  circuit  thus  shorting  out  the  semicon- 
ductor portions  of  the  surface.  An  optical  experiment  was  then  tried.  Eight 
samples  were  subjected  to  ultra  violet  light  while  at  77°K.  No  luminescence 
was  observed.  Three  of  these  samples  were  further  studied  at  4°K  under  illum- 
ination by  a He-Cd  laser  which  radiates  at  3150  X (—3.8  ev) . Again  no  lumin- 
escence was  observed  between  4000  X to  1.2  microns.  This  is  presumably  due 
to  the  irregular  nature  of  the  surface  and  internal  structure  and  a high 
probability  for  non-radiative  transitions.  These  lines  of  inquiry  did  not 
appear  to  be  too  promising  and  so  these  studies  were  discontinued  in  favor 
of  the  thermionic  emission  studies. 

A large  number  of  parameters  are  involved  in  the  cathode  activation  and 
emission  processes.  We  have  made  an  effort  to  improve  our  knowledge  and  control 
of  some  of  these  parameters.  One  of  the  problems  encountered  in  thermionic 
emission  studies  is  accurate  determination  of  the  cathode  temperature  which 
must  be  well  known  for  theoretical  analysis.  The  cathode  is  generally  not 
visible  when  in  operation  and  the  temperature  is  often  inferred  from  optical 
pyrometer  measurements  of  the  nickel  sleeve  upon  which  the  cathode  is  mounted. 
Corrections  must  be  made  for  the  emissivity  of  the  nickel  and  absorption  of 
the  glass  envelope.  Using  optical  pyrometer  measurements  of  cathodes  and 
sleeves  in  combination  with  thermocouple  measurements  we  find  that  the  glass 
wall  reduces  the  apparent  temperature  by  about  four  degrees  C while  the  nickel 


5. 


sleeve  temperatures  are  about  40°C  less  than  the  cathode  temperature.  We  can 
now  make  reliable,  consistent  temperature  measurements. 

V.  MASS  SPECTROMETER 

In  order  to  obtain  more  detailed  information  about  the  processes  occuring 
during  the  activation  and  operation  of  a cathode,  we  have  incorporated  a 
mass  analyser  into  the  system.  An  older  G.E.  model  monopole  partial  pressure 
analyses  was  available.  Its  performance  had  seriously  degraded.  However, 
we  were  able  to  use  the  electron  gun  and  monopole  analyser  portions.  To 
these  we  added  a Kiethley417  High  Speed  Pico-ammeter  to  amplify  the  current 
from  the  analyser,  a Wavetek  146  Multifunction  Generator  to  sweep  the  analyser 
voltage,  a Houston  Instruments  chart  recorder  and  a Tektronics  564  Storage 
Oscilloscope  for  recording  of  the  spectra.  At  high  pressures  (10  ^ . torr  range) 
where  there  are  strong  signals,  we  can  record  a mass  scan  from  0 to  60  Amu  in 
about  90  seconds  with  good  response.  Such  speed  is  useful  when  following  the 
evolution  of  various  gases  during  the  activation  process.  Slower  scans  with 
some  filtering  are  possible  when  signal  to  noise  improvement  is  required.  Two 
examples  are  given  in  Figures  1 and  2.  The  mass  number,  starting  from  zero, 
increases  to  the  right  along  the  abcissa  while  the  signal  amplitude  in  arbi- 
trary units  is  displayed  along  the  ordinate.  The  first  trace  shows  the 
residual  gases  in  a system  that  had  been  pumped  for  a long  time.  The  second 
is  for  the  same  system  some  time  after  the  cathode  in  the  sample  chamber  had 
been  activiated.  The  same  gases  are  present  but  their  proportions  are  dif- 
ferent. Interpretation  of  these  patterns  require  that  calibrations  be  done 
with  the  various  gases  to  obtain  their  cracking  patterns  under  the  existing 
conditions.  Some  calibrations  have  been  done  using  the  vacuum  system  to  be 


described  later. 


We  have  studied  one  cathode  extensively  using  the  mass  analyser.  A 


schematic  diagram  of  the  system  is  shown  in  Figure  3.  A cathode  was 
activated  over  a period  of  a week  and  its  emission  characteristic  studied. 
During  activation,  the  cathode  temperature,  total  gas  pressure  and  gas 
composition  could  be  monitored.  Very  noticable  changes  in  the  gas  composi- 
tion were  observed  during  the  course  of  the  activation  process.  In 
particular,  mass  peak  44  increased  two  orders  of  magnitude  then  decreased 
back  to  its  pre-activation  (residual)  level.  This  seems  to  be  reasonable 
since  carbonates  are  being  broken  down  to  oxides  with  the  evolution  of 
gases  CO  or  CO2.  However  published  mass  spectra  of  CO  2 indicates  that  it 
is  cracked  with  a CO  peak  (mass  28)  being  the  predominant  peak  not  a mass 
44  peak.  This  is  puzzling.  In  addition,  after  the  mass  44  peak  subsided, 
a quite  strong  mass  28  peak  dominated  the  specttrum.  This  is  not  the  case 
for  a residual  gas  spectrum.  This  behavior  can  be  noticed  by  comparing 
Figure  1 with  the  oscilloscope  photographs  in  Figure  4.  The  upper  photo- 
graph was  taken  during  activation.  The  lower  trace  shows  the  strong  peak 
at  mass  44.  (The  upper  trace  is  a 5 x enlargement  of  the  lower  trace) 

The  pressure  in  the  diode  and  temperature  increases,  the  CO2  peak  increases 
and  then  decreases  until  at  930°C  the  composition  of  the  gases  arrears  as 
in  the  upper  trace  of  the  lower  figure.  The  CO  peak  dominates.  The  lower 
traces  in  the  photo  are  for  lower  temperatures  and  pressures  as  the  cathode 
temperature  is  reduced. 

With  the  recent  assembly  of  the  revamped  vacuum  system  shown  in  the 
next  section,  some  calibration  data  has  been  obtained  for  CO,  CO2  and  O2. 
These  are  shown  in  Figures  5,  6,  and  7. 

The  CO  spectra  shows  the  mass  28  - CO  peak  superposed  on  the  residual 
gas  spectra.  The  O2  spectra  show  a strong  O2  peak  at  32  Amu  and  a rela- 
tively strong  0 peak  at  16  Amu.  In  addition  a peak  at  28  occurs  which  must 
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be  in  part  due  to  ^ but  also  is  probably  due  to  some  residual  CO  in  the 
vacuum  system.  (The  CO  spectra  were  taken  before  the  0^  spectra) . The  CO  2 
spectra  exhibits  CO^,  CO,  C and  0 peaks. 

These  calibration  spectra  seem  to  substantiate  the  statement  above  that 
after  the  evolution  of  CO2  during  activation,  here  is  a great  deal  of  CO 
present. 

The  emission  properties  of  the  cathode  were  not  particularly  good  and 
it  seemed  quite  suceptible  to  poisoning.  Some  authors  have  reported  reacti- 
vation of  cathodes  when  operated  in  an  argon  discharge.  This  cathode  did  not 
respond  to  such  treatment. 

A number  of  experimental  problems  became  evident  during  this  experiment. 
The  activation  process  took  almost  a full  week  with  constant  monitoring  and 
recording  of  data.  Overnight  operation  and  more  efficient  data  gathering  is 
needed.  These  and  other  considerations  led  to  the  restructuring  of  the 
system  as  described  in  the  next  section. 

V.  MEASUREMENT  SYSTEM  FOR  THERMIONIC  EMISSION  STUDIES 
A.  Instrumentation 

Initially,  the  current-voltage  characteristics  of  a diode  containing 
a test  cathode  were  obtained  using  the  circuit  configuration  in  Figure  8 . 
None  of  the  data  taking  was  automated.  Data  could  be  read  from  the  meters 
and/or  the  resulting  graph  produced  by  the  recorder  or  oscilloscope  in  the 
case  of  pulsed  operation  of  the  high  voltage  supply. 


Figure  8.  CIRCUIT  DIAGRAM  FOR  OBTAINING  I-V  CHARACTERISTICS  OF  DIODES 


The  next  figure.  Figure  9 , shows  the  system  as  presently  configured. 


The  filament  supply  can  either  be  a constant  voltage/constant  current 
DC  supply  or  a constant  temperature  supply.  This  latter  supply  is  a re- 
designed version  of  a supply  constructed  in  this  laboratory  and  for  resonably 
large  temperature  perturbations  can  maintain  the  temperature  within  ± 1°C. 

This  is  important  since  isothermal  current-voltage  curves  can  best  be  handled 
theoretically. 

The  test  diode  will  usually  be  one  with  a water  cooled  anode  capable  of 
holding  eight  cathodes  for  test.  Diodes  for  specialized  experiments  can  be 
used  or  the  eight  cathode  diode  can  be  reconfigured  for  example,  to  hold  fewer 
cathodes  and  thermocouples  for  direct  temperature  measurement. 

The  voltages  and  currents  that  were  read  from  meters  or  graphs  in  the 
previous  case  can  now  be  fed  into  a data  logging  system  and  can  be  presented 
as  numerical  lists  and/or  as  a graphical  display.  We  can  keep  track  of  the 
many  variables  involved  and  with  the  calculator  can  graph  them  in  various  ways. 
At  present  we  are  using  the  system  on  line.  The  data  is  received,  plotted 
and  not  retained.  An  example  of  the  output  of  the  plotter  is  shown  in 
Figure  10  . The  cathode  in  use  had  gone  through  the  first  activation  step. 

The  figure  records  the  changes  in  the  parameters  during  the  current  activation 
step  for  a period  of  about  seven  hours.  Other  combinations  of  the  parameter 
can  of  course  be  presented  depending  upon  the  program  entered  into  the 
calculator. 

There  are  a number  of  limitations  to  the  system  at  present.  More  memory 
would  be  desirable.  Control  functions  are  lacking.  High  voltage  pulse  data 
can  not  be  handled.  These  latter  two  are  important  limitation.  A relay  boards 
must  be  obtained  or  constructed  that  would  allow  control  of  the  independent 
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Sample/Holder 


Figure  9.  BLOCK  DIAGRAM  OF  MEASURING  SYSTEM 


VOLTAGE  SCALE  (ARBITRARY) 


variables,  temperature,  voltage  etc.,  by  the  calculator  program.  This  would 
allow,  for  example,  for  ovemite  runs  with  suitable  circuitry  for  safeguarding 
the  system  in  the  event  of  unforeseen  circumstances. 

To  handle  pulse  data,  a sample/hold  circuit  has  been  designed  but  not 
constructed.  The  digital  voltmeter  cannot  acquire  readings  in  the  short 
time  available  during  the  voltage  and  current  pulses.  The  sample  and  hold 
circuitry  would  solve  this  problem.  Presently  such  data  is  presented  on  a 
storage  scope  which  does  not  allow  extraction  of  precise  numerical  data  needed 
for  comparison  with  theory.  Pulse  measurements  are  performed  at  the  higher 
voltages  (~  1900V)  to  avoid  excessive  power  dissapation  in  the  test  diode.  It 
is  the  data  at  the  higher  voltages  that  is  most  useful  in  determining  the 
effective  work  function  of  the  cathode. 

B.  Vacuum  System 

A number  of  changes  were  made  to  the  existing  vacuum  system  to  facil- 
itate the  use  of  the  mass  analyser,  increase  the  throughput  and  to  include 
the  capability  of  leaking  gases  into  the  test  diode.  Figure  11  shows  a 
schematic  of  the  system.  The  system  in  this  configuration  has  been  used  to 
introduce  gases  for  calibration  of  the  mass  analyser.  Some  of  the  spectra 
have  been  presented  earlier.  The  sampling  unit  consists  of  three  stainless 
steel  cylinders  and  a manifold  and  valving  for  filling,  purging  and  evacuating 
the  manifold  and  cylinders  as  desired. 

VI.  SUMMARY 

As  this  projact  turned  out,  the  primary  result  was  not  a significant 
increase  in  our  understanding  of  cathodes  but  an  instrumentation  system 
that  allows  easier,  more  complete  data  collection  and  versitile  processing 
and  display  of  the  data.  The  system,  with  the  addition  mentioned,  can  be  used 
for  routine  testing,  life  tests  and  also  for  specialized  experiments  designed 


to  shed  more  light  on  the  physics  of  the  cathodes.  The  experiment  originally 
planned,  but  for  which  there  was  insufficient  time,  was  the  poisoning  experiment. 
A test  vehicle  with  six  cathodes  with  a thermocouple  attached  to  on.e  cathode 
has  been  prepared.  It  remains  to  attach  it  to  the  system,  activate  the  cathodes 
and  to  observe  the  effects  of  the  various  gases  on  the  emission  of  the  cathodes. 
It  is  expected  that  this  experiment  will  be  completed  in  the  near  future. 
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EFFICIENT  FAULT  ANALYSIS  IN  ANALOG  CIRCUITS 
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Abstract 

Fault  analysis  in  analog  networks  is  a form  of  network  parameter 
identification.  The  problem  of  finding  network  parameters  from  measure- 
ments at  the  accessible  terminals  can  be  expressed  as  the  solution  of  a 
system  of  non-linear  equations.  Such  a system  of  equations  is  invariably 
solved  by  a multidimensional  search.  Every  step  of  the  search  requires 
solving  for  the  network  responses  in  terms  of  the  parameters,  comparing 
the  solution  to  the  measured  responses,  and  then  adjusting  the  parameters 
in  such  a way  as  to  produce  a response  closer  to  the  measured  response. 

If  the  network  is  analyzed  by  nodal  equations,  the  computation  of  the  re- 
sponses requires  inverting  a matrix  of  order  equal  to  the  number  of  in- 
accessible nodes.  The  time  for  this  analysis  is  excessive  in  practical 
applications,  and  poses  a major  impediment  to  fault  analysis  in  analog 
circuits.  Using  nodal  equations  for  the  analysis  of  open  circuits  poses 
no  special  problems,  since  an  open  circuit  can  be  represented  by  a zero 
admittance.  Short  circuits,  however,  are  represented  by  an  infinite  ad- 
mittance, which  presents  further  difficulties  in  searching  for  a solution 
when  nodal  equations  are  used. 

We  have  applied  a formula  of  Householder  to  compute  the  response 
of  an  electrical  circuit  with  either  a single  open  or  short  circuit.  In 
the  short  circuit  case  the  matrix  inversion  is  computed  with  about  1/15 
of  the  normal  number  of  multiplications.  In  writing  the  software,  sparse 
matrix  techniques  were  used.  This  reduced  both  the  storage  requirements 
and  the  execution  time.  Examples  are  given. 
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I INTRODUCTION 


It  is  perhaps  surprising  that  although  the  problem  of  fault  iso- 
lation in  analog  circuits  has  been  studied  longer  than  the  comparable 

'1 

problem  in  digital  circuits  , the  digital  problem  is  much  better  under- 
stood, and  automatic  test  equipment  (ATE)  for  isolating  faults  in  digital 
equipment  to  the  chip  and  even  to  the  gate  level  is  available,  but  ATE 
available  for  isolating  faults  in  analog  circuits  is  much  less  sophisti- 
cated[l5-17] . The  explanation  of  this  phenominon  lies  partly  with  the 
fact  that  analog  signals  are  inherently  more  complex  than  digital  signals. 
They  occur  continuously  in  time,  rather  than  at  discrete  times,  and  their 
values  (in  principle)  have  infinite  resolution,  instead  of  being  truncated 
to  a finite  number  of  bits. 

In  the  linear  analog  circuits  discussed  in  this  report,  the  problem 
of  signal  variety  can  be  partly  solved  by  restricting  observation  to  the 
sinusoidal  steady  state,  since  the  system  behavior  to  any  input  can  be 
determined  from  a knowledge  of  the  sinusoidal  steady  state  response  at 
all  frequencies.  Clearly  all  frequencies  cannot  be  considered,  and  the 
problem  of  how  to  choose  a set  of  frequencies  has  only  begun  to  receive 
attention  [13,14],  In  addition,  the  response  at  any  frequency  is  related 
to  the  parameter  values  and  the  network  graph  by  non-linear  equations, 
to  which  the  only  method  of  solution  possible  is  a time  consuming  search. 

In  this  report  we  show  how  the  search  for  single  short  circuits  can 
be  accomplished  efficiently  enough  to  make  the  computation  practical.  We 
also  demonstrate  the  effectiveness  of  this  method  in  identifying  short 
circuit  faults  in  analog  circuits  whose  resistance,  capacitance,  inductance, 
and  gain  values  differ  significantly  from  their  nominal  values. 


^An  extensive  bibliography  of  both  digital  and  analog  fault  isolation 
is  given  by  R.  Saeks  and  S.R.  Liberty  [12]. 
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II  THE  USE  OF  NODAL  EQUATIONS  IN  CATASTROPHIC  FAULT  ANALYSIS 
1.  Formulating  the  y-Parameters 

We  will  consider  linear,  lumped-parameter  networks  having  p+1 
accessible  terminals,  numbered  0 through  p,  as  shown  in  Fig.  1.  The 


Fig.  1 Linear,  lumped  parameter  network  with  p+1  accessible  terminals 


network  of  Fig.  1 contains  n+1  nodes,  including  the  reference  node,  and 
b branches  and  is  assumed  to  contain  no  internal  independent  sources 
but  may  contain  dependent  sources.  We  will  formulate  the  nodal  equations 
using  the  standard  branch  shown  in  Fig . 2 . 

The  passive  element  in  branch  k,  denoted  by  its  admittance  Y^, 
must  be  non-zero,  but  any  of  the  sources  may  be  zero  of  all  may  be 
present,  if  desired.  In  the  equations  that  follow,  v is  the  branch 
voltage  vector,  is  the  branch  current  vector,  e is  the  node  voltage 
vector,  Vg  is  the  independent  voltage  source  vector,  and  is  the 
independent  current  source  vector,  using  the  notation  of  Deso.er  and  Kuh 
[1] . The  dimension  of  each  of  these  vectors  is  bxl.  Considering  the 
kth  standard  branch  shown  in  Fig.  2,  it  is  clear  that  the  branch  v-i 
constraints  are  given  by 


i 


(G^  + Y - Yu)v+  (a  - YR  ) j - Yv_ 


I 
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(1) 


Fig  2 Standard  branch  k 


where  Y * diag(Y.  ) is  the  branch  passive  admittance  matrix,  and  G , u 

K “Til 

a,  and  R are  the  coupling  matrices  due  to  the  controlled  sources.  The 
m 

dimension  of  each  of  these  matrices  is  bxb.  (The  elements  of  matrices 

G and  a are  current  sources,  and  the  elements  of  matrices  u and  R are 
— m — •*=-  — m 

voltage  sources  controlled  respectively  by  voltages  and  currents).  Following 
the  convention  of  Desoer  and  Kuh,  define  the  node-to-branch  incidence  matrix 

A. 

f +1,  if  branch  j leaves  node  i 
{A}  » t -1,  if  branch  j enters  node  i 

J V 0,  if  branch  j is  not  incident  on  node  i 

Solving  (1)  for  j_  and  using  the  Kirchhoff  current  law  * 0 and  the 

T 

Kirchhoff  voltage  law  A e.  = \r,  it  can  be  shown  that  the  network  node 
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equations  are 


AYLATe  = i (2) 

where  is  the  bxb  branch  admittance  matrix  given  by 

Y.  = (1  + TO  - a)_1(Y  - Yu  + G ) (3) 

— b — — m — — ~m 

and  i.  is  the  nxl  vector  of  independent  source  currents 

i - (1  + YR^  - a)"1(Yvs  - j^)  (4) 


Note  that  Yv  is  a Norton  transformation  of  the  independent  voltage 
sources.  Because  of  the  choice  of  standard  branch  shown  in  Fig.  2, 
the  network  can  always  be  modelled  with  current  sources  only. 

In  both  (3)  and  (4)  it  is  necessary  to  invert  the  matrix 
1 + TO  - a in  order  to  formulate  the  node  equations.  Johnson  and 
Pennington  [2]  have  shown  that  if  the  network  can  be  modelled  so 
that  the  branch  current  j of  a branch  which  contains  a current  con- 
trolled source  does  not  control  any  source,  then  the  inverse  of  1 + YR  - a 

— — TO 

is  1 - YR  + a.  Since  most  networks  can  be  modelled  this  way,  forming 
— — m — 

the  node  equations  does  not  present  any  difficulty. 

According  to  the  convention  chosen,  nodes  0 through  p are 
accessible,  and  nodes  p+1  through  n are  inaccessible.  Partition  the 
rows  of  the  node-to-branch  incidence  matrix  A into  accessible  and 
inaccessible  nodes 


(5) 


where  A^  is  the  pxn  incidence  matrix  of  accessible  nodes,  and  A?  is 
the  qxn  incidence  matrix  of  inaccesible  nodes,  and  q = n-p  is  the 
number  in  inaccessible  nodes.  Using  (5)  in  (2)  and  partitioning  i^ 
and  e as  follows 
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e = 


where  and  e^  are  pxl  vectors,  and  and  £ ^ are  vect°rs , we 
obtain 


Wi 


44V 


444  1 Ui 


444*  U> 


Now  eliminate  e^  from  (6)  and  make  use  of  the  fact  that  _i  » (3,  since 
no  independent  sources  are  connected  to  inaccessible  nodes,  to  obtain 

4 ’ *lVi  ' i2T<W2T)"1A24,%Til  (7> 


so  that 


4 ' 44 ~ 4 (444  ) 44  ]4 


is  the  pxp  matrix  of  measurable  y-parameters  defined  at  the  ports 
having  a common  "ground"  node  (node  0 in  Fig.  1).  The  subscript 
i indicates  that  the  y-parameters  are  evaluated  at  frequency  w . 


2.  Fault  Analysis 

Let  ^ be  the  matrix  of  measured  y-parameters,  where  the 
measurement  is  made  at  frequency  A fault  represented  by  a shift 

of  the  network  element  values  from  their  nominal  values  could  be 
identified  by  solving  the  non-linear  equation 


k - b.  <v  w 

for  the  unknown  parameter  values.  If  a single  frequency  is  used, 
as  in  (9),  the  number  of  network  parameters  usually  exceeds  the  number 
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of  independent  y-parameters,  so  that  (9)  has  no  unique  solution.  One 
approach,  suggested  by  Ransom  and  Saeks  [3],  is  to  find  the  solution 
to  (9)  that  minimizes  a norm  |jX^0  “ Y^j| , where  is  evaluated 
from  the  nominal  network  element  values.  Ransom  and  Saeks  give  an 
approximate  solution  to  this  problem  using  linear  methods.  The 
difficulty  with  this  approach  is  that  the  solution  assumes,  roughly 
speaking,  that  the  most  likely  state  of  the  element  values  in  the 
network  is  the  one  which  causes  the  smallest  drift  from  the  nominal 
values  consistant  with  the  measurements.  This  assumption  excludes  the 
possibility  of  catastrophic  faults  (open  and  short  circuits) . 

Another  approach  is  to  augment  (9)  by  measurements  at  a sufficient 
number  of  frequencies  that  a unique  solution  exists.  The  relationship 
between  fault  resolution  and  the  number  of  frequencies  and  measurements 
is  discussed  by  Sen  and  Saeks  [9,  10].  Although  their  algorithm  requires 
choosing  a number  of  frequencies,  it  is  not  clear  how  this  is  done. 

Thus  we  might  define: 

1 • ' * * VT 

T (10) 

2 - • 2k) 
and  solve  the  equation 

i-  zcy  (id 

Nonlinear  equations,  such  as  (11),  are  often  solved  by  defining  a 
norm  c *||£  - ^Jj,  and  then  making  a multidimensional  search  for  min  c. 

The  amount  of  computation  required  by  such  a search  could  be  prohibitive. 
Chen  and  Saeks  [4]  show  that  if  only  one  parameter  is  changed  at  a time, 
the  measurable  system  function  can  be  evaluated  without  the  need  to 
invert  a matrix,  which  is  apparently  necessary  in  (8).  Their  method, 
is  based  on  a formula  given  by  Householder  [5]. 
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In  this  paper  we  show  how  the  Householder  formula  can  be  used 

to  efficiently  analyze  networks  containing  short  circuits  by  considering 

the  change  in  the  network  graph.  The  measurable  effects  of  open 

circuits  and  finite  parameter  changes  can  be  analyzed  by  considering 

changes  in  Y , and  although  this  is  essentially  equivalent  to 
b 

the  method  described  by  Chen  and  Saeks,  it  is  given  here  in  the  context 
of  nodal  analysis  so  that  all  parameter  changes  can  be  analyzed  with 
one  formulation  of  the  network  equations. 

3.  Short  Circuits 

If  two  nodes  of  a network  become  shorted,  the  new  node-to-branch 

incidence  matrix  A'  is  related  to  the  original  incidence  matrix  in  a 

very  simple  way.  If  nodes  i and  j are  shorted,  then  A'  is  formed  by 

replacing  row  i by  the  sums  of  rows  i and  j , and  then  deleting  row  j . 

If  node  i is  the  reference  node,  then  A’  is  formed  by  deleting  row  j. 

These  row  transformations  can  be  obtained  by  premultiplying  A by  a matrix 

R,  each  element  of  which  is  either  0 or  1.  Fig.  3 illustrates  the  row 

transformation  matrix  describing  two  different  short  circuits  (one 

involving  the  ground  node  and  the  other  not  involving  the  ground  node) 

in  a network  with  5 nodes  and  7 branches.  The  incidence  matrices  are 

partitioned  by  the  dashed  lines  under  the  assumption  that  nodes  3 and  4 

in  the  original  network  are  inaccessible.  Thus  the  top  half  of  the 

A matrix  is  A.  and  the  bottom  half  is  A . A similar  partition  is  shown 
■ T ^ 

for  A'  *(A^  j Ap  , but  in  the  transformed  network  some  of  the  formerly 
inaccessible  nodes  may  have  been  made  accessible  (by  virtue  of  their 
having  been  shorted  to  an  accessible  node) . 

The  II  matrix  is  the  "after-to-before"  node  transformation  matrix, 
which  has  a column  for  every  original  node  in  the  network  and  a row 
for  every  node  after  the  fault.  (Since  node  0 does  not  enter  into  A, 
it  does  not  appear  in  R either) . Thus  R may  be  partitioned  by  columns 
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Fig.  3 Effect  of  short  circuits  on  node-to-branch  incidence  matrix,  A. 
(a)  Original  network  graph.  (b)  Short  circuit  between  nodes  2 and  3.~ 
(c)  Short  circuit  between  nodes  2 and  ground  or  0. 

and  rows  into  the  accessible  and  inaccessible  nodes  before  (columns) 
and  after  (rows)  the  fault,  as  shown  in  Fig.  3. 

Denote  the  partitioning  of  R by 


R 


A 
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(12) 


where  the  lower,  left  partition  in  (12)  is  0 since  the  faulted  network 
cannot  have  any  inaccessible  nodes  which  were  accessible  in  the  un- 
faulted network.  In  other  words,  if  a short  circuit  occurs  between  an 
accessible  and  an  inaccessible  node,  the  combined  node  becomes  accessible. 
Thus 


A’  = RA  = 


-11-1  + -12-2 


-22-2 


and  therefore 


-11-1  + -12-2 


A' 

-2 


-22-2 


(13) 


3.1  Forming  the  y-Parameters  of  the  Shorted  Network 

The  y-parameters  of  networks  containing  a short  circuit  can  be 

computed  from  (8)  after  replacing  A by  A'  and  A by  A'.  Define 
AT  112/ 

Yg  = ^2YbA2  ’ *-s  ^xcl  no<*al  admittance  matrix  of  inaccessible 

nodes.  In  the  analysis  that  follows,  we  assume  that  Y^  is  known.  (In 

fact  must  be  computed  only  once  for  any  circuit  using  the  nominal 

element  values,  so  the  cost  of  its  computation  is  a negligible  part  of 

the  total  cost  of  analysis) . 

If  the  short  circuit  occurs  between  two  accessible  nodes,  then 
A^  * Ag,  an<*  new  y_parameters  are  found  from  (8)  with  no  matrix 
inversion.  In  fact,  if  a short  circuit  exists  between  two  accessible 
nodes,  it  can  be  easily  observed  by  making  resistance  measurements 


between  all  pairs  of  accessible  nodes,  so  the  method  described  here 
is  not  needed. 


If  the  short  circuit  involves  any  inaccessible  node,  then  A ' ^ A 

T ^ 2 

and  the  inverse  of  Y^  = A2^L^2  must  computed.  The  main  contribution 

of  this  paper  is  to  show  that  the  inverse  of  Yl  may  be  found  in  terms 


of  with  about  one  fifteenth  as  many  multiplications  as  are  required 
to  find  the  inverse  of  a general  qxq  matrix. 


3.2  Finding  the  Inverse  of  Y^  Efficiently 

From  the  definitions  of  A^»  —2  anc*  —2  Is  eas^  to  s^ow 

that 


—2 


—22^2^22 


Let  us  add  a row  to  as  follows 


(14) 


(15) 


where  u is  a row  unit  vector  containing  all  zeros  except  for  a 1 in  the 

rightmost  column  representing  a shorted  node.  (At  most  two  columns  of 

R„_  are  involved  in  the  short  circuit.  If  only  one  is  involved,  then 
T 

that  column  of  u is  unity.  If  two  columns  of  are  involved,  then 
only  the  rightmost  one  is  unity.  This  convention  could  be  reversed  by 
changing  the  definition  of  R) . We  now  decompose  £ into  U and  P 

S - U + P (16) 

where  £ is  formed  as  follows.  If  the  short  circuit  is  between  an 
accessible  and  an  inaccessible  node,  then  P * £.  If  the  short  circuit 
is  between  two  inaccessible  nodes,  then  f P)  is  unity,  and  the 

— 1 _ 1 “O 
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remaining  elements  are  zero.  (In  this  case,  the  unit  element  in  P 

is  element  , where  nodes  i and  j are  shorted,  and  i<j  due  to 

our  numbering  convention).-  With  these  definitions,  it  is  easy  to  see 

that  IJ  is  real  and  unitary,  since  its  columns  are  independent  unit  vectors 

and  thus  form  an  orthonormal  set  [7].  Consequently  U_1  = U,  and  if  the 

short  circuit  is  between  an  accessible  and  an  inaccessible  node,  then 

Ji.  * S.  If  the  short  circuit  is  between  two  inaccessible  nodes,  then 

jJ  is  only  slightly  more  difficult  to  compute,  as  we  will  show.  Now 
A T 

define  £ = SY^S  , and  therefore 


R„„ 

22 

r t*  i 

Y R i it 

—2  ! — 22— 2— 

T 

_u 

-2  [-22  | - 

.—  ^2—22  ! . 

and 


K 

: L ‘ 

M 

1 

' N . 

(17) 


Faddeeva  [8]  shows  that  K 


ir1 


[ .1-  0*22—2—)—]  ’ and  therefore 


(18) 


The  inverse  on  the  right  side  of  (18)  can  be  computed  efficiently 

using  a formula  given  by  Householder  [6,7],  which  states  that  if 
T 

A ■ B - vv  , where  v and  w are  column  vectors,  then 


+ 


-1  T 
B vw  B 


1 


1 

v 


(19) 
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where  the  division  on  the  right  side  of  (19)  is  by  a scalar.  Thus 
the  inverse  of  is 


K 


1 + 1 - ”<*22^ 


(20) 


In  order  to  use  (20) , K and  M must  be  found  from  (17) . We  have 

mentioned  that  if  the  short  circuit  is  between  an  accessible  node  and 

-1  T 

an  inaccessible  node,  then  “ S_.  In  fact,  if  the  short  circuit  is 

between  an  accessible  node  and  node  n,  then  = I_,  and  K and  M are 

simply  partitions  of  (17).  If  the  short  circuit  is  between  two  in- 

-1  T 

accessible  nodes,  we  have  shown  that  .S  = U + £,  where  U = U,  and 

T 

the  rank  of  P is  unity.  Thus  £ can  be  written  in  the  form  vw  , and 

(19)  can  be  used  to  find  Proceeding  in  this  manner  it  can  be  shown 

-1  T 

that  S_  * 1J  + where 

(-1,  if  r * i-p,  s = n-p 
0,  otherwise 

And  so,  even  in  the  most  complex  case,  IC  and  M can  be  evaluated  without  a 
matrix  inversion.  Some  time  can  be  saved  in  computing  the  triple  matrix 
product  in  (17)  by  noting  that  L and  N (each  column  vectors)  are  not  needed, 
so  the  last  column  of  £ need  not  be  evaluated.  Using  straightforward 

matrix  multiplication  to  find  K and  M from  (17)  and  then  Yl  ^ from  (20) 

2 4 3 2 L 

requires  q (q  - 2q  + 3q  -q  + 1)  complex  multiplications,  where  q is 

the  number  of  accessible  nodes  in  the  network. 

This  method  of  computing  * is  unnecessarily  inefficient  since 

many  of  the  matrices  in  (17)  and  (20)  are  sparse,  and  others  (j>  and  , 

in  addition,  are  special  since  their  non-zero  elements  are  either  +1  or  -1. 

In  Section  5 we  show  how  to  reduce  the  number  of  complex  multiplications 

-1  2 

required  to  evaluate  Y^  to  89  (q— 1) » where  $ is  typically  about  0.3  and 

is  always  a positive  number  less  than  unity. 

Once  Y^  * is  known,  the  y-parameters  at  the  accessible  nodes  are 

found  from  (8) . Evaluating  the  matrix  products  in  (8)  in  a straightforward 
3 2 2 

manner  requires  2b  + (2p  + q - l)b  + (q-1)  b multiplications,  where 
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b is  the  number  of  branches  in  the  network,  p is  the  number  of  accessible 
nodes  not  counting  the  reference  node,  and  q is  the  number  of  inaccessible 
nodes.  Using  the  sparse  matrix  techniques  described  in  Section  5,  the 


number  of  multiplications  needed  to  evaluate  ^ from  Y^  can  be  reduced 


,-l 


-2 


to  2(1  + y)b  , where  y is  typically  about  0.1.  For  example,  a network 
with  20  branches,  3 accessible  nodes  (plus  the  accessible  reference  node), 
and  8 inaccessible  nodes  requires  14180  multiplications  to  evaluate  ^ 
directly  from  (8).  By  using  sparse  matrix  techniques,  this  is  reduced  to 
880  multiplications,  a saving  of  a factor  of  16  in  this  case.  The  straight- 
forward evaluation  in  general  requires  nearly  b times  as  many  multiplications 
as  the  more  efficient  approach  using  sparse  matrix  techniques,  so  the  saving 
increases  with  network  complexity. 


4.  Open  Circuits  and  Other  Large  Parameter  Changes 

For  completeness,  we  conclude  this  paper  with  a discussion  of 


open  circuits  and  other  large  parameter  changes,  although  this  has 
been  discussed  elsewhere  [4].  The  main  purpose  in  this  section  is  to 
show  how  the  analysis  can  be  carried  out  in  the  context  of  nodal  analysis. 
Parameter  changes  are  accounted  for  in  (8)  by  a change  in  the 


bxb  branch  admittance  matrix  Y,  . 

b 


In  order  to  evaluate  the  new  y-parameters 


from  (8),  the  inverse  of  the  inaccessible  nodal  admittance  matrix 


Y„  * A„Y,  A„  must  be  found.  If  a single  parameter  in  the  network  is 
—2  — 2— b— 2 » 


changed,  then  the  new  branch  admittance  matrix  is  Yj*  ■ Y^  + _A,  where 


■=b  ~b 

has  onlv  one  non-zero  element  representing  the  change  in  one  parameter. 

T 


Suppose  a change  occurs  in  element  ij . Then  ^ can  be  written  duv  , where 
d is  a scalar  equal  to  the  parameter  change,  u is  a unit  vector  in 
direction  i,  and  v is  a unit  vector  in  direction  j.  Now  define 

Householder  shows  that 


U * A^JJ  and  V - A^v,  so  that  Y^  ■ Y^  + dUVT. 


the  inverse  of  Y^  can  be  found  from 


-1  -1  -1  T -1  -IT  -1 

Y ' - Y2  [1  - U(d  + V Y2  U)  V Y2  ] 


(21) 


-1  T -1 

The  inverse  of  Y2  is  known,  and  d + V Y 2 U is  a scalar,  so  (21)  can 


be  evaluated  without  a matrix  inversion. 
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(I 


1 


I 


5.  Sparse  Matrix  Techniques 

Sparse  matrices  arise  in  both  the  evaluation  of  the  inverse  if 
and  in  the  evaluation  of  the  accessible  y-parameter  matrix  These 

are  discussed  separately  in  Sections  5.1  and  5.2.  Section  5.3  summarizes 
the  total  saving  in  computational  complexity  and  shows  an  example. 


5. 1 Evaluating  the  Inverse  of  Yj 

We  have  shown  that  the  inverse  of  Y^  may  be  found  from  (20) , which 
is  repeated  below  for  convenience: 

l2_1  = + ^22-2-W1  " ^%2-2-))]  (21) 

where  K and  M are  defined  by 

T -1  -1  -1 

- (S1)  \LS  (22) 

The  dimensions  of  K and  M are  respectively  (q-l)x(q-l)  and  lx(q-l). 

In  (22)  Y^  is  known,  and  S^  is  defined  by 


K L 
M N 


u 


(23) 


R22  is  the  matrix  which  transforms  A^,  the  (inaccessible  node)-to- 
branch  incidence  matrix  before  the  fault,  into  A^,  the  (inaccessible  node)- 
to-branch  incidence  matrix  after  the  fault,  so  that 

A2  = A22— 2 (24) 

In  the  discussion  that  follows,  the  reader  may  find  it  helpful  to  refer 
to  the  illustration  of  Fig.  3.  Suppose  nodes  i and  3 are  shorted,  where 
j > i.  Then  rows  1 through  1-1  of  contain  zeros  except  that  the 
diagonal  element  is  unity.  This  indicates  that  nodes  1 through  i-1  in 
the  faulted  network  are  identical  to  nodes  1 through  i-1  in  the  unfaulted 
network.  Row  i of  contains  all  zeros  except  for  unity  elements  in 


■ " 


jr 


columns  i and  j , indicating  that  node  i in  the  faulted  network  is  formed 
by  combining  nodes  i and  j in  the  unfaulted  network.  Finally,  the  re- 
maining rows  of  R^2  contain  all  zeros,  except  that  the  first  row  to  the 
right  of  the  diagonal  is  unity.  This  indicates  that  the  remaining  nodes 
of  the  faulted  network  are  identical  to  the  remaining  nodes  of  the  un- 
faulted network.  The  faulted  network  contains  one  less  node  than  the 
unfaulted  network,  so  the  dimension  of  R^  is  (q-l)xq,  where,  as  usual, 
q is  the  number  of  inaccessible  nodes. 

We  have  chosen  to  number  the  nodes  consecutively,  beginning  with 
the  accessible  nodes.  We  have  also  avoided  analysing  networks  with  a 
short  circuit  between  two  accessible  nodes,  since  such  a fault  may  be 
found  from  driving  point  impedance  measurements  between  all  pairs  of 
accessible  nodes.  Thus  node  j is  always  inaccessible,  but  node  i may  be 
either  accessible  or  inaccessible.  In  the  description  above  we  have 
assumed  that  node  i is  inaccessible.  If  it  is  accessible,  then  is 
simpler  than  described  above,  since  all  rows  contain  a single  unit  element. 
(The  unit  element  is  on  the  diagonal  in*  rows  1 through  j-1  and  in  the  first 
column  to  the  right  of  the  diagonal  in  all  other  rows.  This  indicates  that 
all  inaccessible  nodes  in  the  faulted  network  correspond  to  an  inaccessible 
node  in  the  unfaulted  network,  but  one  inaccessible  node  in  the  unfaulted 

network  is  now  accessible,  and  so  it  has  no  row  in  R»-) • 

T ^ 

In  (23)  _u  is  a row  vector  containing  all  zeros  except  that*  column 

j is  unity. 

To  illustrate  the  formation  of  the  matrix,  consider  a network  with 

8 inaccessible  nodes.  (For  convenience  in  this  discussion  they  are  numbered 

1 through  8,  instead  of  p+1  through  p+8) . Figure  4(a)  shows  the  matrix 

for  a short  circuit  between  nodes  4 and  6,  and  Fig.  4(b)  shows  the  j> 

matrix  for  a short  circuit  between  node  6 and  any  accessible  node.  The 

-I  T 

formation  of  the  inverse  of  the  £ matrix  is  given  by  £ » U + (£,  where 


II 

i 


-1,  if  r = i-p,  s = n-p 
0,  otherwise 
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Fig.  4.  £ matrices  for  a network  with  8 inaccessible  nodes.  (a)  Short 

circuit  between  nodes  4 and  6.  (b)  Short  circuit  between  nodes  6 and 

any  accessible  node. 


as  explained  in  Section  3.2,  where  jL  ■ U + £,  and  all  elements  of  J?  are 
zero  except  if  the  shorted  nodes  i and  j are  both  inaccessible,  in  which 
case  [Ply  is  unity.  The  unity  element  of  £ is  circled  in  Fig.  4(a). 

In  this  illustration  we  have  taken  p-0  for  convenience,  where  p is  the 
number  of  accessible  nodes.  Also  note  that  n-p  • q.  The  inverse  of  the 
S.  matrix  can  be  formed  by  inspection,  and  in  particular,  Fig.  5 shows  the 
inverses  of  the  matrices  of  Fig.  4. 
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Fig.  5.  (a)  Inverse  of  the  matrix  of  Fig.  4(a).  (b)  Inverse  of  the  matrix 

of  Fig.  4(b) 


We  now  turn  to  the  problem  of  evaluating  K and  M.  The  right  side  of 
(22)  may  be  written 

(u  + £t)y:i(ut  + £)  = uY-y  + uy'1^  + ivy  + (23> 


Recall  that  the  rightmost  column  (column  q)  of  (23)  is  not  needed. 
The  second  and  fourth  terms  on  the  right  side  of  (23)  are  zero  except  for 
the  rightmost  column,  so  they  need  not  be  evaluated. 

To  evaluate  UY„  IJ  , define  the  row  vector  V - [v^] , where  v^  * 

(unit  column  of  row  k in  D),  k ■ 1,  2,  •**,  q-1.  Thus 

&L  - 2^v  ,,  • (m  ■ 2-  O-1)  <24> 

“ ° V”  - 2-  <fV 

Finally  M is  the  lx(q-l)  row  vector  whose  elements  are  the  first  q-1 
columns  of  the  last  row  (row  q)  of  the  sum  of  terms  one  and  three  of  (23) . 


[m]  - cuy"1/]  + 

— n L — q ,n  / — q ,n 

“ Cy:1].  v - [Yl1]  , (n  - 1,  2,  •••,  q-1)  (25) 

n ’ n 

K and  M are  evaluated  from  (24)  and  (25)  with  no  multiplications,  and  the 

evaluation  of  K,  the  larger  of  the  two,  requires  no  additions. 

Now  that  K and  M are  known,  Y^  * can  be  found  from  (21).  In 

evaluating  (21)  it  is  useful  to  first  find  the  (q-l)xl  column  vector 

x - JUjYjU.  Recall  that  u Is  a vector  with  all  zero  elements  except  that 

row  j is  unity.  Thus  x is  the  j column  of  ^22X2’  R22  t'ie  Partf-t:fon 

of  £ formed  from  the  first  q-1  rows  (23).  We  may  make  use  of  the  row 
T 

vector  V to  form  x as  follows: 


Wn  ■ [R22^2^n 


[Y.l  . , if  node  i is  accessible 

-2  vJ 


• vn  * 1 

C-2]vn,J  + ’ vn 


, node  i is 
inaccessible 


(n  = 1,  2,  •••,  q-1) 
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Using  the  definition  of  x in  (26),  (21)  may  be  written 


Y^"1  “ K + K[jdl/  ( 1 - toe)  ] 


(27) 


which  requires  evaluating  three  matrix  products,  two  of  which  directly 
involve  x.  From  (26)  it  is  clear  that  x is  formed  by  reordering  the 
elements  of  column  j of  Since  only  a few  inaccessible  nodes  are 

normally  coupled  to  node  j,  x is  usually  sparse.  Define  8 equal  to  the 


fraction  of  non-zero  elements  of  column  j of  Y^«  The  number  of  multiplica- 


tions required  to  compute  Mx  is  6q  ^ q.  6 is  usually  significantly  less 
than  one,  and  so  it  is  worth  designing  the  algorithm  to  avoid  the  multi- 


plications by  zero.  Similarly,  6q  rows  of  xM  are  zero,  and  it  is  worth 

2 


forming  only  the  non-zero  rows.  Thus  (27)  can  be  evaluated  with  8q  (q-1) 
multiplications  and  divisions.  Note  that  the  multiplications  in  (27)  are 
the  only  multiplications  required  to  evaluate  Y£  since  K and  M have 
been  formed  in  (24)  and  (25)  without  multiplications. 


5.2  Evaluating  the  y-Parameters 

In  this  section  we  discuss  the  evaluation  of  the  y-parameter  matrix 
given  by  (8) . In  the  faulted  network,  the  node-to-brance  incidence 

so  that  (8)  becomes 


matrices  are  Aj  and  A^, 


2-i  * AjlfcCi  “ -2  -2  -2^-1 


(28) 


T T 

Two  forms  appear  in  (27)  which  must  be  discussed  separately:  A YA  and  AYA  . 


The  most  convenient  way  to  store  the  node-to-branch  incidence  matrices. 


which  we  will  refer  to  generally  by  A in  the  discussion  which  follows,  is  by 


two  column  vectors  A and  A , defined  as  follows.  If  the  k 

A 


th 


has  a +1  element,  then  [A+ 


column  of  A 
th 


(row  number  of  the  +1  element  of  the  k 

column  of  A),  otherwise  [A  ] 

. **•  t 

a -1  element,  then  [A  ],  * (row  number  of  the  -1  element  of  the  k column 


of  A) , otherwise  [A 


0.  The  formation  of  A and  A corresponding  to  a 


given  node-to-branch  incidence  matrix  A is  illustrated  in  Fig.  6.  Note 

.+ 


that  in  practice  it  is  necessary  to  store  only  the  vectors  A and  A and 
not  the  matrix  A. 
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Fig.  6.  Node-to-branch  incedence  matrix  A,  and  the  corresponding  compact 
equivalent  vectors  A+  and  A . 


(29) 


Now  consider  the  evaluation  of  X = A YA.  Element  [X]^  n is  given  by 

- l [AT].,k  l H^'Ca]^ 

■ [1][a+L.[a+L  + ^[A'^.CA-^ 

' Cl][A+]„.CA'L  ' [I][A':m,[A+], 


— m — n 


If  any  of  the  subscripts  of  [Y)  in  the  last  expression  is  zero,  the  term 
is  zero. 

T 

The  evaluation  of  X'  * AYA  is  somewhat  different  because  the  sum- 
mation is  along  the  rows  of  A Instead  of  down  the  columns.  In  particular, 

element  [ X* ] is  given  by 

m,n 
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(30) 


tx’]m>n  - K*]„>k  Ki:k>1-[AT]1>n 

• kHA:a>k-tA]njl-cx]kil 


The  right  side  of  (30)  cannot  be  evaluated  by  choosing  m and  n as  in  (29). 
Instead  k and  1 are  chosen  and,  for  each  choice,  [y]^  ^ is  added  to  ele- 
ments [X  ] [a+]i  an<*  [a  ],_,  [A  ],  and  subtracted  from  elements 


fvl  I A. 

— J[A  ]^»  [A  and  [X'  ][a“]  [a+]  * ^ any  t^ie  subscripts  of  [x' ] is 

zero,  that  term  is  zero. 

Products  of  the  forms  X'=<  AYa'*’  and  X = A^YA  both  appear  in  (28),  and 
we  have  shown  how  to  evaluate  them  without  multiplication.  An  additional 
product  of  the  form  X' *Y^  must  be  evaluated,  where  Y^  is  given  by  (3).  Y^ 
is  typically  very  sparse,  so  in  the  software  we  have  chosen  to  store  it  as 
three  vectors:  one  containing  the  non-zero  elements  of  Y^>  and  the  other 
two  containing  respectively  the  row  and  column  of  that  element.  The  saving 
in  storage  can  be  appreciated  by  recognizing  that  Y^  is  a bxb  matrix,  where 
b is  the  number  of  branches  in  the  network.  The  elements  of  Y,  which  are 

“O 

not  zero  are  the  b diagonal  elements  plus  one  additional  element  for  each 

controlled  source.  If  the  number  of  controlled  sources  is  b/10,  then  the 

sparse  storage  of  Y,  requires  storing  1.1b  complex  numbers  and  2.2b  integers, 
2 

compared  with  the  b complex  numbers  required  for  conventional  storage.  If 

the  software  is  written  to  accommodate  networks  with  100  branches,  then 

4 

conventional  storage  requires  setting  aside  room  for  10  complex  numbers, 
compared  to  about  120  complex  numbers  and  240  integers  required  by  compact 
storage. 

Considerable  computation  time  is  also  saved  by  storing  Y.  sparsely. 

3 “ 

The  matrix  product  X' in  (28)  would  require  b complex  multiplications 

if  performed  directly,  since  the  dimensions  of  both  JC'  and  Y,  are  bxb. 

2 ~ “ 

The  algorithm  of  Fig.  7 forms  the  product  in  (1  + y)b  complex  multiplica- 
tions, where  y is  the  fraction  of  the  b branches  containing  a controlled 
source.  Typically  y is  about  0.1.  In  Fig.  7,  YB(*)  is  a vector  containing 

the  IYB  non-zero  elements  of  Y,  , and  YBR0W(*)  and  YBC0L(*)  are  integer 

— b 

vectors  containing  the  row  and  column  numbers  of  each  element  of  YB(*). 
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The  product  X' • Y^  is  called  D(*,*)*  D(*,*)  is  not  sparse,  so  it  is  stored 

convent ionally . 


Fig.  7.  Efficient  formation  of  the  matrix  product  X* »Y. 


Examination  of  (28)  reveals  another  matrix  product  of  the  form  Y,  •£, 

T -1  ~ 0 

where  in  this  case  Z = 1 + A_Y'  AlY,  . This  matrix  product  is  formed  in  a 

— L — Z — L D 


similar  manner  to  X'  Y,  in  (1  + y)b2  complex  multiplications.  Thus  alto- 


gether 2(1  + y)t>  complex  multiplications  are  required  to  formulate  A 

direct  evaluation  of  (28)  requires  2b  ^ + (q-l+p)b^  + ((q-l)^  + p^)b  multi- 


plications. Taking  p“2,  b=20,  q*ll  we  find  that  ^ requires  22880  multi- 
plications by  direct  methods,  but  only  880  multiplications  by  the  methods 
described  in  this  section  with  y=  0.1,  a saving  of  a factor  of  26  in  this 
example. 

Before  closing  this  section  it  should  be  noted  that  the  y-parameters 
of  the  unfaulted  network  are  evaluated  before  evaluating  the  y-parameters 


of  any  faulted  networks.  This  is  done  by  applying  (28)  with  A.  replaced  by 

-1  -1  1 

Aj,  A^  replaced  by  A^,  and  Y^  replaced  by  Y^  • The  inverse  of  Y^  ®ust  be 


« 
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computed  by  conventional  methods,  but  the  methods  of  this  section  may  be 
used  for  the  remaining  computation. 

5.3  Summary  of  Efficient  Computation 

The  y-parameters  at  the  accessible  ports  of  the  faulted  network  are 

found  from  (28) , where  in  the  context  of  this  report  the  faulted  network 

is  formed  from  the  unfaulted  network  by  shorting  a single  pair  of  nodes. 

In  Section  5.1  we  showed  how  to  form  the  inverse  of  the  (q-l)x(q-l) 

2 

matrix  Y'  with  only  8q  (q-1)  complex  multiplications,  where  8 is  the 

^ T 

fraction  of  non-zero  elements  in  Y,..  (Recall  that  Y„  = A„Y.  A„  is  the 

—2  —2  —2—b—2 

inaccessible  nodal  admittance  matrix).  Inversion  of  a (q-l)x(q-l)  matrix 

3 

requires  about  5 (q-1)  complex  multiplications  using  Wilf's  rank  annihilation 
method^[ 11],  which  is  an  efficient  application  of  the  Householder  formula 
(19),  [5-7]. 

2 

In  Section  5.2  we  showed  that  (28)  can  be  evaluated  in  2(1  + y)b 

complex  multiplications.  This  result  applies,  whether  or  not  the  efficient 

evaluation  of  the  inverse  of  Y^  section  5.1  is  used.  Thus  the  total 

number  of  complex  multiplications  required  to  evaluate  the  y-parameters  at 

the  accessible  ports  of  the  network  with  one  pair  of  shorted  terminals  is 
2 2 

2(1  + y)b  + q (q-1).  If  the  inverse  of  Y^  is  found  by  conventional 

methods  but  the  sparse  matrix  techniqes  of  Section  5.2  are- used  to  form 
2 3 

y-f , then  2(1  + y)b  + 5(q  - 1)  complex  multiplications  are  required. 

The  y-parameters  resulting  from  all  possible  single  short  circuits  „ 
involving  at  least  one  inaccessible  node  must  be  evaluated.  If  the  network 
contains  p accessible  terminals  and  q inaccessible  terminals,  then  it  can 
be  shown  that  N » q(p  + q)  - q(q  - 1 ) / 2 distinct  pairs  of  shorted  terminals 
can  be  identified.  The  total  number  of  complex  multiplications  is  there- 
fore N[2(l  + y)b2  + Bq2(q  -1)]  using  the  efficient  method  in  inverting  Y’ 

2 3 ^ 

and  N[ 2(1  + y)b  + 5(q  - 1)  ] using  a conventional  inversion  method.  To 

show  the  benefit  of  the  efficient  matrix  inversion,  let  us  assume  that  the 

number  of  accessible  ports,  p,  is  2,  the  number  of  branches  b is  twice  the 

*An  error  appears  in  our  edition  of  Ralston  and  Wilf  [11].  In  the  flow  chart 
on  p.  75,  step  11  should  be  l-*-i,  not  2-*-i. 
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number  of  nodes,  p + q + 1,  the  fraction  y of  branches  having  controlled 
sources  is  0.1,  and  the  fraction  6 of  non-zero  elements  of  Y^  is  0.3. 

Using  these  assumptions,  the  total  number  of  complex  multiplications 
necessary  to  analyize  all  the  inaccessible  short  circuits  using  both  the 
efficient  and  the  conventional  methods  of  inverting  are  shown  in  Fig.  8. 
The  advantage  of  the  efficient  method  is  especially  noticable  for  the 
larger  networks. 


NUMBER  OR  INACCESSIBLE  NODES.  0 


Fig.  8.  Number  of  complex  multiplications  required  to  analyze  all 
inaccessible  short  circuits  in  a network  with  q inaccessible  nodes. 

h.  Status  of  Current  Software 

The  algorithms  described  above  have  been  implemented  in  Fortran  IV. 

Currently  two  main  programs  are  available  which  access  the  same  sub- 
^rifraa*.  Each  accept  as  data  a description  of  the  network,  including 
>•  .dent if ication  of  the  accessible  nodes.  MAINI  then  prints  the  y- 
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parameters  at  the  accessible  terminals  computed  from  the  nominal  network 
and  from  every  network  that  can  be  obtained  from  the  nominal  network  by 
shorting  pairs  of  terminals  in  which  at  least  one  terminal  in  the  pair  is 
inaccessible.  MAIN2  is  identical  to  MAIN1  except  that  in  addition  it 
requires  as  data  a set  of  y-parameters  measured  at  the  accessible  terminals. 
MAIN2  then  compairs  all  computed  y-parameters  to  the  measured  y-parameters, 
determines  the  closest  match,  and  takes  this  case  as  the  most  likely  short 
circuit. 

MAIN1  may  be  used  to  generate  test  data  for  MAIN2.  Used  in  this  way 
the  network  parameter  values  (resistances,  capacitances,  inductances,  and 
controlled  source  gains)  given  to  MAIN2  are  the  nominal  values,  but  those 
given  to  MAIN1  differ  from  nominal  by  whatever  amounts  are  desired  for  the 
test.  The  user  then  selects  one  set  of  y-parameters  from  the  printout  of 
MAIN1.  These  computed  y-parameters;  connreponding  to  a specific  short  cir- 
cuit with  off-nominal  element  values  in  addition,  are  then  used  as  "measured" 
data  for  MA1N2.  In  this  way  it  is  possible  to  test  the  "robustness"  of  the 
method  of  short  circuit  isolation;  i.e.  to  test  its  ability  to  find  the 
correct  short  circuit  in  the  presense  of  "noise".  This  is,  of  course, 
necessary  in  practice,  since  the  parameters  of  the  measured  network  will 
not  be  exactly  the  nominal  values. 

The  use  of  the  computer  programs  is  described  in  Section  6.1,  and 
some  test  results  are  given  in  Section  6.2. 

6.1  Using  the  Software 

The  software  is  stored  on  disk  files  on  the  DEC10  system  in  Building 
620,  WPAFB,  Ohio.  The  files  are  accessed  from  user  number  [7777,137], 
passward  "ALFIE".  The  Fortran  (*.F0R)  files  are  stored  under  the  names 
MAIN 1 .FOR,  MAIN2 .FOR,  BL0CK4.F0R,  FAULT4 .FOR,  SNEW4 .FOR,  and  MINVC.FOR. 

To  create  a load  module  to  execute  MAIN1 , the  following  command  is  used: 

.LOAD  MAIN1.BL0CK4,  FAULT4 , SNEW4 , MINVC 

A load  module  to  execute  MAIN2  is  formed  in  exactly  the  same  way,  except 
that  "MAIN1"  is  replaced  by  "MAIN2".  (The  .SAVE  command  must  be  given 
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immediately  after  the  .LOAD  command,  or  the  load  module  will  not  be  stored 
on  the  disk) . 

In  the  interest  of  saving  disk  space,  all  relocatable  binary  (*.REL) 
files  have  been  deleted.  They  may  be  created,  if  desired,  by  the  .COM 
command.  For  example,  .COM  FAULT4.F0R  will  cause  the  FAULT4 . FOR  file  to 
be  compiled  by  the  Fortran  compiler  and  stored  under  the  name  FAULT4.REL. 

The  ?oad  modules  have  already  been  created  and  stored  under  the 
names  MAIN1.EXE  and  MAIN2.EXE.  To  execute  these  files  from  the  CRT  terminal, 
the  command  .RUN  MAIN I or  .RUN  MAIN2  is  used. 

After  the  command  .RUN  MAIN1  has  been  given,  the  following  prompt 
message  appears  on  the  screen: 


ENTER  IP,  N,  FREQ 


(31) 


The  dots  under  the  text  are  guides  marking  the  beginning  and  end  of  each 
of  the  three  data  fields.  (The  input  format  is  215,  E10.5).  The  first 
data  value  (IP)  is  an  integer  equal  to  the  number  of  accessible  nodes 
(not  counting  the  reference  node,  node  0).  The  second  data  value  (N)  is 
an  integer  equal  to  the  total  number  of  nodes  in  the  network  (not  counting 
the  reference  node) . The  third  data  value  (FREQ)  is  a real  number  equal 
to  the  frequency  in  Hz  at  which  the  analysis  is  to  be  preformed. 

After  entering  these  three  numbers  (and  pressing  the  "return"  key), 
the  following  prompt  message  appears  on  the  screen: 


ENTER  NB,  Nl,  N2,  ITYPE,  VALUE 


(32) 


where  again  the  dots  are  guides  to  the  beginning  and  end  of  each  data  field. 
(In  this  case  the  input  format  is  415,  E10.5).  This  message  prompts  the 
user  to  enter  the  description  of  a network  branch  or  controlled  source. 

In  order  to  understand  how  to  enter  this  data,  some  conventions  must  be 
explained.  Each  network  branch  contains  a single  passive  element  (resistor, 
capacitor,  or  inductor).  The  controlled  sources  are  associated  with 
passive  branch  elements  and  are  not  counted  as  branches.  Branches  are 
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numbered  consecutively,  beginning  with  1.  Otherwise  the  branch  numbering 
is  arbitrary,  and  the  branch  data  need  not  be  ent  ered  in  same  order  as 
the  branch  numbers.  Each  branch  is  connected  between  two  nodes.  The  nodes 
are  numbered  consecutively  beginning  with  0.  By  the  convention  we  have 
chosen,  node  0 is  the  reference  node  for  the  node  voltages.  The  accessible 
nodes  are  numbered  1,  2,  IP.  The  software  is  dimensioned  to  accept 

at  most  3 accessible  nodes  (IP  <_  3) , 17  inaccessible  nodes  (N  <_  20)  , and 
50  branches.  In  entering  data  for  a passive  branch,  and  referring  to  (32), 
the  first  data  value  (NB)  is  an  integer  equal  to  the  branch  number,  the 
second  and  third  data  values  (N1  and  N2)  are  integers  equal  to  the  node 
numbers  at  the  two  ends  of  the  branch,  the  fourth  data  value  (ITYPE)  is 
an  integer  indicating  the  type  of  passive  element  (1  = resistor,  2 = in- 
ductor, 3 * capacitor),  and  the  fifth  data  value  (VALUE)  is  the  element 
value  (in  ohms,  henries,  or  farads,  as  appropriate).  The  order  in  which 
the  two  node  numbers  (N1  and  N2)  are  entered  establish  the  branch  reference 
direction,  as  shown  in  Fig.  9.  This  is  important  only  if  a controlled 


Fig.  9.  Reference  directions  for  the  branch  current  and  branch  voltage 

v,  for  branch  k. 
k 

source  is  coupled  to  or  from  the  branch.  If  no  controlled  source  is 
coupled  to  or  from  the  branch,  N1  may  be  either  node  number  and  N2  the 
other.  Controlled  source  data  will  be  discussed  in  the  next  paragraph. 

The  prompt  message  (32)  appears  on  the  screen  each  time  the  return  key 
is  pressed.  After  all  branch  data  in  entered  (including  controlled  source 
data),  enter  99999  followed  by  a "return".  The  prompt  message  "NETWORK 
INPUT  COMPLETE"  will  then  appear  on  the  screen  to  reassure  the  user  that 
all  is  well  so  far. 
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Controlled  source  data  Is  entered  under  the  same  prompt  message  (32) 
as  the  passive  branch  data  and  may  be  mixed  In  with  the  passive  branch 
data  or  entered  last,  as  the  user  prefers.  In  entering  controlled  source 
data,  the  first  data  value  entered  (NB)  is  a five  digit  Inter  beginning 
with  4,  such  as  40001.  The  same  number  may  be  used  for  all  controlled 
sources.  The  second  and  third  data  values  (N1  and  N2)  are  the  branch 
numbers  of  the  "to"  and  "from"  branches  respectively.  The  "to"  branch  is 
the  branch  containing  the  source,  and  the  "from"  branch  is  the  branch 
which  controls  the  source.  Fig.  10  shows  the  conventions  for  the  four 
types  of  controlled  sources  (1  = transconductance,  2 * transresistance, 

3 = current  gain,  4 = voltage  gain) . The  fourth  data  value  entered  is 
an  integer  equal  to  the  type  of  controlled  source,  and  the  fifth  data 
value  entered  is  a real  number  equal  to  the  gain  value  (in  mhos  for  type 
1 sources,  ohms  for  type  2 sources,  or  dimensionless  for  type  3 and  4 
sources) . 


conventions. 
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This  completes  the  data  requested  if  MAIN1  is  executed.  If  MAIN2 
is  executed,  the  prompt  message  below  appears  after  "NETWORK  INPUT  COM- 
PLETE" : 

ENTER  YN(1,1)  (33) 

• • • • 

The  dots  below  the  text  indicate  the  beginning  the  end  of  the  two  data 
fields,  which  in  this  case  are  the  real  and  imaginary  parts  respectively 
of  the  measured  y-parameter  in  mhos.  The  number  of  measured  y-parameters 
requested  depends  on  the  number  of  accessible  ports  (1  if  one  port  is 
accessible,  4 if  two  ports  are  accessible,  and  9 if  three  ports  are  acces- 
sible) . 

The  major  output  from  the  program  appears  on  the  line  printer. 

An  example  illustrating  the  use  of  the  program  is  given  in  Section 

6.2. 


6.2  Example  of  Software  Use 

Suppose  we  wish  to  analyse  the  network  shown  in  Fig.  11.  This 
network  contains  20  branches,  which  are  assigned  the  branch  numbers 
shown  in  parentheses  beside  the  branch  reference  direction  (1,  2,  ••*,  20 
in  arbitrary  order).  The  reference  node  is  numbered  0,  and  the  remaining 
11  nodes  are  numbered  1,  2,  ••*,  11,  beginning  with  the  accessible  nodes 
(nodes  1,  2,  and  3).  Very  simple  models  have  been  chosen  for  the  two 
transistors  in  this  example,  but  models  of  greater  complexity  may  be  used 
if  desired.  The  data  in  the  order  in  which  it  is  entered  is  given  in  Fig. 12, 
and  Fig.  13  shows  some  sample  output  from  the  line  printer.  The  analysis 
is  performed  at  1000  Hz,  as  seen  from  the  first  line  of  data. 
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Fig.  13.  Sample  output  from  MAIN1,  using  data  of  Fig.  12. 
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6.3  Effectiveness  of  Software  in  Locating  Short  Circuit  Faults 

The  network  of  Fig.  11  was  used  to  test  the  effectiveness  of  the 
software  (MAIN2)  in  locating  a single  short  circuit.  The  nominal  ele- 
ment values  for  ’-.he  network  were  entered  in  addition  to  the  9 complex 
y-parameters  at  the  three  available  ports  (nodes  1-0,  2-0,  and  3-0) 
"measured"  from  the  faulty  network.  The  "measured"  data  was  generated 
from  MAIN1  using  element  values  set  from  10%  to  20%  above  or  below  the 
nominal  values.  Table  1 shows  the  actual  fault,  the  fault  determined  by 
the  minimum  cost  function,  and  the  cost  function  of  the  actual  fault. 

The  cost  function  used  was 


y iliLlZiL 

i.J  I y±j I 


(34) 


where  y^  is  the  y-parameter  computed  by  MAIN2  and  y is  the  measured 
y-parameter  (in  this  case  computed  from  MAIN1) . Of  the  seven  examples 
at  1000  Hz,  the  program  correctly  located  five  of  the  faults.  The  two 
errors  seemed  to  be  caused  by  the  fact  that  the  magnitude  of  the  capacitor 
impedances  at  1000  Hz  were  much  lower  than  the  impedance  levels  of  the 
network  resistances.  (The  magnitude  of  the  impedance  of  a 10  pF  capacitor 

Table  1 . Summary  of  Test  Results 


FREQUENCY 

ACTUAL 

SHORTED  NODES 

MINIMUM 

COST  FUNCTION 

HZ 

SHORTED  NODES 

OF  MINIMUM 

COST  FUNCTION 

OF  CORRECT 

COST  FUNCTION 

FAULT 

at  1000  Hz  is  15.9  0).  The  test  was  rerun  at  10  Hz  for  the  two  incor- 
rectly Identified  cases,  and  the  faults  were  still  not  correctly  iden- 
tified. It  was  expected  that  the  short  circuit  between  nodes  6 and  7 
would  be  difficult  to  identify,  since  the  50  yF  bypass  capacitor  is  only 
about  3 ft  at  1000  Hz  , and  so  the  changes  in  the  parameters  of  the  mea- 
sured circuit  could  be  excpeted  to  mask  the  short  circuit.  In  fact,  one 
of  the  reasons  for  running  the  tests  was  to  see  how  much  the  network  para- 
meters could  be  changed  before  this  change  prevented  reliable  identifi- 
cation of  the  short  circuit.  Variations  of  10%  to  20%  still  permitted 
correct  identification  in  5 out  of  7,  or  71%  of  the  cases  tried.  If  the 
parameters  of  the  "measured"  network  were  set  to  the  nominal  values, 
then  correct  identification  would  occur,  of  course,  in  100%  of  the  cases, 
since  then  no  error  would  exist  between  the  "measured"  y-parameters  and 
those  generated  by  MAIN2  for  the  correct  fault.  A sequence  of  tests 
should  be  conducted  with  increasing  amounts  of  off-nominal  parameter 
variation.  This  has  not  yet  been  done  due  to  time  limitations. 
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III  CONCLUSIONS 

We  have  shown  an  efficient  way  to  analyse  networks  containing  a 
single  short  circuit  or  other  larger  parameter  change.  The  efficiency 
results  from  the  fact  that  the  network  change  caused  by  either  the  short 
circuit  or  other  large  parameter  change  results  in  an  alteration  of  the 
inaccessible  nodal  admittance  matrix  by  a matrix  of  unit  rank.  The  re- 
sponse of  the  faulted  network  can  therefore  be  computed  in  terms  of  the 
nominal  response  with  many  fewer  multiplications  than  are  required  for 
and  original  analysis.  Sparse  matrix  techniques  were  also  used  to  re- 
duce execution  time. 

The  efficiency  of  the  algorithm  makes  it  practical  to  exhaustively 
search  all  possible  single  short  circuits  involving  an  inaccessible  node 
in  order  to  find  a solution  to  the  problem  of  locating  a short  circuit  in 
an  analog  network  from  measurements  at  the  accessible  terminals.  The  test 
examples  described  in  Section  6.3  show  that  short  circuits  can  be  reliably 
located  by  the  algorithm,  even  though  measurements  were  made  at  only  one 
frequency  and  differences  between  the  actual  and  the  nominal  parameter 
values  were  not  taken  into  account.  We  expect  that  the  reliability  can 
be  improved  by  making  measurements  at  more  than  one  frequency  and  by 
solving  for  the  parameter  values. 


IV  RECOMMENDATIONS 


The  algorithm  described  in  this  report  is  an  important  first  step 
in  the  development  of  a practical  algorithm  for  fault  analysis  in  analog 
circuits.  The  test  results  are  most  encouraging  and  justify  further  work 
to  imporve  the  reliability  of  the  method . 

Some  steps  which  should  be  taken  to  improve  the  reliability  of  the 
fault  isolation  algorithm  are  summarized  below. 

• Investigate  the  problem  of  determining  a set  of  frequencies  at 
which  independent  measurements  can  be  made.  The  more  independent  data 
that  is  available,  the  more  successful  the  vault  isolation *can  be  expec- 
ted to  be. 

•Investigate  the  efficacy  of  incorportating  parameter  (resistance, 
capacitance,  inductance,  controlled  source  gain)  changes  into  the  fault 
model  in  addition  to  short  circuits.  We  have  discussed  an  efficient  way 
to  recomputing  the  y-parameters  at  the  accessible  nodes  after  the  occur- 
rance  of  a single  large  parameter  change,  but  this  is  not  implemented  in 
the  present  algorithm.  The  investigation  should  determine  if  a multi- 
dimensional search  for  the  best  parameter  fit  improves  the  reliability  of 
the  algorithm,  and  if  it  can  be  done  fast  enough  to  justify  its  inclusion 
in  the  algorithm. 


APPENDIX  A 

COMPUTER  PROGRAM  LISTINGS 


The  listings  of  all  Fortran  computer  programs  described  or 
referred  to  in  this  report  are  listed  in  this  appendix.  These  programs 
are  stored  in  disk  files  under  user  number  [7777,137],  password  "ALFIE", 
on  the  DEC10  System  in  Building  620,  WPAFB,  Ohio.  The  programs  are 
listed  under  the  various  file  names  in  which  they  are  stored:  MAIN1, 
MAIN2,  BL0CK4,  ELEMNT,  OTHER4,  FAULT4 , SNEW,  and  MIN VC. 

Two  non-standard  statements  (INCLUDE  ' • ')  appear  in  the 
listings:  INCLUDE  '0THER4 . FOR'  and  INCLUDE  'ELEMNT. FOR' . These  state- 
ments cause  the  statements  in  files  0THER4 . FOR  or  ELEMNT. FOR  to  be  in- 
serted in  place  of  the  INCLUDE  statement.  The  INCLUDE  statements  have 
been  used  to  insert  COMMON  statements  in  the  appropriate  places. 
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BL0CK4 . FOR 


BLOCK  BATA 
INCLUDE  '0THER4.F0R' 

BATA  A2PLUSr A2MNUS/100*0/ 
BATA  AIPLUSf  A1MNUS/100K0/ 
END 


ELEMNT . FOR 

COMMON/ELEMNT/Y (65) r GM<15>r  RM(15>f  XMU(15>f  ALPHA<15)f 


YB<65>f  YROW (65 ) f GMR0W(15>f  GMC0L<15>f 

2 RMROU< 15) f RMC0L(15>f  XMUR0U(15)f  XMUC0LC15>f 

3 AROU( 15) f ACOL( 15) f YBR0W(65)f  YBC0L(65)f  IGMf 

4 IRMf  IXMUf  I A f IALPHAf  IYB 

INTEGER  YROUf  GMROWf  GMCOLf  RMROWf  RMCOLf  XMUROUf  XMUCOLf 
1 AROUf  ACOLf  YBROWf  ybcol 


COMPLEX  Yf  YB 


0THER4 . FOR 


1 

2 

3 


COMMON/OTHER/ AIF'LUS ( 50 ) f A1MNUS(50)f  A1PLSP(50)f 
Y2(17f17)f  Y2I < 17  f 17  > f Y2PI(16f16)f 
A2MNUS<50  ) f A2PLSF'  < 50 ) f A2MNSF'(50)f 
Nf  IBf  ITOTf  YN(3f3> 

INTEGER  A2PLUS f A2MNUSf  A2PLSPf  A2MNSP 
INTEGER  AIPLUSf  AIMNUSf  AIFLSPf  A1MNSP 
COMPLEX  YNf  Y2f  Y2I f Y2PI 


A1MNSF’  ( 50 ) f 
A2FLUS  < 50 ) f 
If  Jf  IPf 


! 
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MAIN1 .FOR 


00020 
00040 
0 0060 
00080 
00085 
00090 
00095 
00098 
00100 
00200 
00210 
00230 
00250 
00270 
00300 
00330 
00350 
00370 
00 400 
00500 
00550 
00570 
00590 
00600 
00650 
00690 
00695 
00700 
00750 
00770 
00790 
00795 
00800 
00900 
01000 
01100 
01200 
01230 
01250 
01270 
01290 
01300 
01400 
01500 
01600 
01700 
01800 
01900 


C THIS  PROGRAM  COMPUTES  THE  V-PARAMETERS  OF  AH  ELECTRICAL 

C NETWORK  WITH  INTERNAL  SHORT  CIRCUITS.  AN  EFFICIENT 

C ALGORITHM  IS  USED.  SEE  REFERENCE! 

C 

C A.T.  JOHNSON.  JR. . 'EFFICIENT  FAULT  ANALYSIS  IN  ANALOG 

C CIRCUITS*.  FINAL  REPORT.  USAF-ASEE  SUMMER  FACULTY 

C RESEARCH  PROGRAM.  WPAFB,  OHIO.  AUGUST  1978 

C 

INCLUDE  'ELEMNT .FOR' 

INCLUDE  '0THER4.F0R' 

C 

C REAP  NETWLRK  DATA  AND  COMPUTE  NODE-TO-BRANCH  INCIDENCE 

C MATRICES  (SPARSE  STORAGE)  OF  ORIGINAL  NETWORK!  A1PLUS. 

C AiMNUS . A2PLUS.  A2MNUS 

CALL  INPUT 
C 

C COMPUTE  YB  MATRIX!  REF..  EQ.(3>. 

C 

CALL  YBMAT 
10  = N-IP 
C 

C COMPUTE  Y2  = A2  * YB  * A2T , REF.,  EQ.<2>. 

C 

CALL  Y2PMAT ( Y2 . IQ . 1 7 , A2PLUS , A2MNUS > 

C 

C COMPUTE  INVERSE  OF  Y2. 

C 

CALL  MINVC<  Y2,  Y2I  , IQ.  1 7> 

C 

C COMPUTE  Y-PARAMETERS  AT  ACCESSIBLE  TERMINALS. 

C REF.,  EQ . (8) . 

C 

CALL  YMAT ( A1PLUS, AIMNUS . A2PLUS . A2MNUS , Y2I , YN , IP , IB , 1 7 > 
WRITE'<3, 102) 

DO  15  M=1.IP 

15  WRITE <3, 100) < YN(M.NX) ,NX=1 .IP) 

NCASES  = IQ#N  - ( (IQ-1 >*I0>/2 
C 

C LOOP  THROUGH  ALL  PAIRS  OF  SHORT  CIRCUITS  INVOLVING 

C INACCESSIBLE  NODES.  I AND  J ARE  THE  SHORTED  NODES. 

C 

DO  10  K = IP+l.N 
WRITE (5.110)  NCASES 
110  FORMAT ( /5X, 13, ' CASES  LEFT') 

J-N+IP+l-K 
NCASES  = NCASES  - J 
DO  10  L=1 , J 
I = L-l 


■ 
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01920 

C 

01940 

c 

01960 

c 

01980 

c 

01985 

02000 

c 

02020 

c 

02040 

c 

02060 

c 

02080 

02100 

02200 

c 

02300 

02330 

c 

02350 

c 

02370 

02400 

c 

02500 

02600 

10 

02700 

100 

02800 

02900 

101 

03000 

03100 

102 

COMPUTE  NEW  NODE-TO-BRANCH  INCIDENCE  MATRICES  (A1PLSP. 
A1MNSP.  A2PLSP.  A2MNSP)  WITH  NODES  I AND  J SHORTED. 
REF.f  EQ. (20) . 

CALL  ANEW  ' 

COMPUTE  INVERSE  OF  Y2P  FOR  FAULTED  NETWORK. 

REF.»  EQ. (28) . 

CALL  SNEW 

CALL  THAT < A1 PLSP r A1MNSP r A2PLSP f A2MNSP , Y2P I.YN.IP.IB.16) 
WRITE ( 3> 101 ) I.J 

PRINT  Y-PARAMETERS. 

DO  10  M=1 . IF 

WRITE(3r 100)  <YN<M.NX> .NX=1.IP> 

STOP 

FORMAT ( 3(10X»  2E12.4.  ' J')  //) 

FORMAT ( ' 1 ' » 10X. ' Y-PARAMETERS  WITH  NODES' » 13. 

* ' AND'. 13. ' SHORTED'///) 

FORMAT (///»1 OX . 'NOMINAL  Y-PARAMETERS.  MHOSJ'/) 

END 
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MAIN2 . FOR 


00100 

00200 

00300 

00400 

00500 

00600 

00700 

00800 

00900 

01000 

01100 

01200 

01300 

01400 

01500 

01600 

01700 

01800 

01900 

02000 

02100 

02200 

02300 

02400 

02500 

02600 

02700 

02800 

02900 

03000 

03100 

03200 

03300 

03400 

03500 

03600 

03700 

03800 

03900 

04000 

04100 

04200 

04300 

04400 

04500 

04600 

04700 

04800 

04900 

05000 

05100 

05200 

05300 


THIS  PROGRAM  ISOLATES  SHORT  CIRCUIT  FAULTS  IN  ANALOG  NETWORKS. 
THE  FAULT  IS  ISOLATED  DY  COMPARING  COMPUTED  Y-PARAMETERS  AT 
THE  ACCESSIBLE  NODES  (FOR  THE  NETWORK  WITH  A PAIR  OF  NODES 
SHORTED)  TO  MEASURED  Y-PARAMETERS  AT  THE  SAME  ACCESSIBLE 
NODES.  FOR  A DESCRIPTION  OF  THE  EFFICIENT  ALGORITHM.  SEES 

A.T.  JOHNSON.  JR. , 'EFFICIENT  FAULT  ANALYSIS  IN  ANALOG 

CIRCUITS'.  FINAL  REPOSR.  USAF-ASEE  SUMMER  FACULTY  RESEARCH 
PROGRAM.  WPAFB.  OHIO.  AUGUST  1978 

INCLUDE  'ELEMNT .FOR' 

INCLUDE  ' 0THER4.FQR' 

DIMENSION  DEN0M<3»3> 

COMPLEX  YNN0M(3.3> 

READ  NETWORK  DATA  AND  COMPUTE  NODE-TO-BRANCH  INCIDENCE  MATRICES 
(SPARSE  STORAGE)  OF  ORIGINAL  NETWORK.  A1PLUS.  A1MNUS.  A2PLUS. 
A2MNUS. 

CALL  INPUT 

READ  MEASURED  Y-PARAMETERS?  YNNOM 

DO  5 K=1 .IP 
DO  5 L=1 » IP 
WRITE(5. 106)  K.L 
WRITE (5.110) 

READ(5> 108)  YNNOM (K.L) 

DENOM(K.L)  = CABS ( YNNOM (K.L)) 

IF (DENOM(K.L)  .£Q.  0.)  DENOM(K.L)  = l.E-9 
WRITE (3. 109) 

DO  1 M=1 > IP 

WRITE (3. 100) ( YNNOM (M» NX) »NX=1 . IP) 

IQ=  N-IP 

COMPUTE  YB  MATRIX.  SEE  REF..  EQ.(3>. 

CALL  YBMAT 

COMPUTE  Y2  = A2  * YB  * A2T • SEE  REF. » EQ.(2>. 

CALL  Y2PMAT ( Y2. IQ, 17. A2PLUS . A2MNUS > 

COMPUTE  INVERSE  OF  Y2. 

CALL  MINVC(Y2.Y2I , IQ. 17) 

COMPUTE  Y-PARAMETERS  AT  ACCESSIBLE  TERMINALS.  REF.,  EQ.(8). 

CALL  YMAT ( A1PLUS, A1MNUS. A2PLUS » A2MNUS , Y2I.YN.IP.IB.17) 

I SAVE  = 0 
JSAVE  ■=  0 
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05400 

C 
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FRO*  COPY  rURHISHED  TO  DOQ  ■ 

05500 

C 

PRINT  Y-PARAMETERS  OF  UNFAULTED  NETWORK  WITH  NOMINAL 

05600 

C 

PARAMETER  VALUES. 

05700 

c 

05800 

WRITE (3*102) 

05900 

DO  2 M=1 ? IP 

06000 

WRITE  < 3t 100 ) < YN<  M?  NX) t NX=1 1 IP ) 

06100 

c 

- 

06200 

c 

COMPUTE  COST  FUNCTION  OF  UNFAULTED  NETWORK.  CALL  IT  COSTMM. 

06300 

c 

06400 

COSTMN  = 0. 

06500 

DO  3 11=1? IP 

06600 

DO  3 Jl=ltIP 

06700 

3 

COSTMN  = COSTMN  + CABS< YN< 11 ? J1 > - YNNOM< 1 1 r J1 ) >/DENOM( I 1 ? J1 ) 

06800 

WRITE ( 3 1 107)  COSTMN 

06900 

NCASES  = IG*N  - <<IG-l>*I0>/2 

07000 

C 

07100 

c 

LOOP  THROUGH  ALL  PAIRS  OF  SHORT  CIRCUITS  INVOLVING  AN 

07200 

c 

. INACCESSIBLE  NODE.  I AND  J ARE  THE  SHORTED  NODES. 

07300 

c 

07400 

DO  10  K = IP+l.N 

07500 

WRITE (Still)  NCASES 

07600 

111 

FORMAT ( /5X 1 13 1 ' CASES  LEFT') 

07700 

J=N+IP+1-K 

07800 

NCASES  = NCASES  - J 

07900 

DO  10  L=1 1 J 

08000 

I = L-l 

08100 

c 

08200 

c 

COMPUTE  NEW  NODE-TO-BRANCH  INCIDENCE  MATRICES  (AlPLSPt  A1MNSP? 

08300 

c 

A2PLSP.  A2MNSP)  WITH  NODES  I AND  J SHORTED.  SEE  REF.t  EQ.  <13 

08400 

c 

08500 

CALL  ANEW 

08600 

c 

08700 

c 

COMPUTE  INVERSE  OF  Y2P  FOR  FAULTED  NETWORK.  SEE  REF.t  EQ.(IO) 

08800 

c 

08900 

CALL  SNEW 

09000 

c 

09100 

c 

COMPUTE  Y-F'ARAMETERS  AT  ACCESSIBLE  TERMINALS  OF  FAULTED 

09200 

c 

NETWORK.  SEE  REF.t  E0.<28>. 

09300 

c 

09400 

CALL  YMAT( AlPLSP. A1MNSP?A2PLSP?A2MNSP? Y2PI i YNt IP? IB? 16) 

09500 

c 

09600 

c 

COMPUTE  COST  FUNCTION  OF  CURRENT  Y-PARAMETERS  (NODES  I AND  J 

09700 

c 

SHORTED) . 

09800 

c 

09900 

COST  = 0. 

10000 

DO  20  11  = 1?  IP- 

10100 

DO  20  J1=1?IP 

10200 

20 

COST  = COST  + CABS<  YN( 1 1 ? J1 ) - YNNOM< 1 1 ? J1 > )/DENOM< 1 1 t J1  ) 

10300 

IF(COST  - COSTMN)  22?21t21 

10400 

22 

COSTMN  = COST 

10500 

ISAVE  = I 

10600 

JSAVE  = J 

I 
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10700  C 

10800  C . PRINT  NODES  I AND  J AND  THE  COMPUTED  Y-PARAMETERS  AT  THE 

10900  C . ACCESSIBLE  TERMINALS. 

11000  C 

11100  21  WRITE ! 3 , 101 ) I.J 

11200  DO  30  M=1  , IF' 

11300  30  WRITE!3» 100)  < YN(M.NX) ,NX=1 , IP) 

11400  10  WRITE! 3. 107)  COST 

11500  C 

11600  C PRINT  NODES  OF  LOWEST  COST  FAULT  AND  COST  FUNCTION  FOR 

11700  C THIS  PAIR  OF  SHORTED  NODES. 

11800  C 

11900  WRITE ( 3 > 103 ) ISAVE.  JSAVE,  COSTMN 

12000  STOP 

12100  100  FORMAT  (3 ( 10X, 2E12 . 4 » ' J')//) 

12200  101  FORMAT! ' 1 ' t 10X. 'Y-PARAMETERS  WITH  NODES', 13, 

12300  * ' AND' » 13.  ' SHORTED '///> 

12400  102  FORMAT!///. 10X» 'BEGIN  SEARCH  USING  NOMINAL  NETWORK.'///. 

12500  * 10X.  'Y-PARAMETERS  WITH  NO  SHORT  CIRCUITS.  MHOSS'/) 

12600  103  FORMAT! '1' ,10X. 'MOST  LIKELY  SINGLE  SHORTS '//l IX. 'NODES' . 

12700  * 13.'  AND'. 14.'  SHORTED. '////10X. ' !COST  FUNCTION  ='. 

12800  * Ell. 4.')'  ) 

12900  106  FORMAT!///, 5X, 'ENTER  YN< '. II .11 .')  ' ) 

13000  107  FORMAT!//, 56X, 'COST  =',  Ell. 3) 

13100  108  FORMAT !E10.4,1X,E10.4) 

13200  109  FORMAT!///, 10X, 'MEASURED  Y-PARAMETERS,  MHOS.'/) 

13300  110  FORMAT!'  . . . .') 

13400  200  FORMAT !8!F8.2»F6.2» ' J')> 

13500  END 


FAULT4 . FOR 


IHIS  PAGES  IS  BEST  QUALITY  FRACIICA&H 
FRO*  OOPY  TUEfflUSHED  TO  OOQ  ■ 


00100 

00200 

C 

00300 

C 

00400 

C 

00500 

C 

00600 

C 

00700 

C 

00800 

C 

00900 

C 

01000 

C 

01100 

C 

01200 

C 

01300 

C 

01400 

C 

01500 

c 

01600 

c 

01700 

c 

01800 

c 

01900 

c 

02000 

c 

02100 

c 

02200 

c 

02300 

c 

02400 

c 

02500 

c 

02600 

c 

02700 

02800 

02900 

03000 

103 

03100 

03200 

108 

03300 

03400 

03500 

03600 

03700 

03800 

03900 

04000 

04100 

1 

04200 

104 

04300 

04400 

109 

04500 

04600 

04700 

04800 

04900 

10 

05000 

05100 

05200 

16 

05300 

05400 

17 

05500 

15 

05600 

05700 

19 

05800 

05900 

30 

06000 

18 

SUBROUTINE  INPUT 

THIS  SUBROUTINE  READS  THE  NETWORK  DATA  AND  STORES  THE  NODE- 
TO-BRANCH  INCIDENCE  MATRICES. 

VARIABLE  -DEFINITIONS? 

IB  - NUMBER  OF  BRANCHES  IN  NETWORK  (PASSIVE  ELEMENTS) 

IGM  - NUMBER  OF  TRANSCONDUCTANCES 

IRM  - NUMBER  OF  TRANSRESISTANCES 

I YMU  - NUMBER  OF  VOLTAGE  GAINS 

I ALPHA  - NUMBER  OF  CURRENT  GAINS 

N - NUMBER  OF  NODES  (NOT  COUNTING  REFERENCE  NODE) 

IP  - NUMBER  OF  ACCESSIBLE  NODES  ( * ) 

FREQ  - FREQUENCY  AT  WHICH  ANALYSIS  IS  PERFORMED  (HZ) 

Y - MATRIX  OF  PASSIVE  ELEMENT  ADMITTANCES  (SPARSE) 

GM  - MATRIX  OF  TRANSCONDUCTANCES  (SPARSE) 

RM  - MATRIX  OF  TRANSRES I ST ANCCS  (SPARSE) 

XMU  - MATRIX  OF  VOLTAGE  GAINS  (SPARSE) 

ALPHA  - MATRIX  OF  CURRENT  GAINS  (SPARSE) 

YROW ( . > VECTOR  STORING  ROW  OF  Y(.) 

GMROU(.)  VECTOR  STORING  ROW  OF  GM(.) 

GMCOL ( • ) VECTOR  STORING  COLUMN  OF  GM( . > 


SIMILARLY  FOR  RMROW . RMCOL.  XMUROW.  XMUCOL.  AROW.  ACOL. 

INCLUDE  '0THER4.F0R' 

INCLUDE  'ELEMNT .FOR' 

WRITE(5. 103) 

FORMAT ( 10(/>  » ' ENTER  IP.  N.  FREQ') 

WRITE (5.108) 

FORMAT ('  . ..  ..  .') 

READ(5.100>  IP. N. FREQ 

OMEGA  = 2.*3.141593*FREQ 

IB  = 0 

IGM  4=  0 

IRM  = 0 

IXMU  = 0 

I ALPHA  = 0 

URITE(3. 102)  N.  IP.  FREQ 
WRITE(5. 104) 

FORMAT (5(/) > ' ENTER  NB.  Nl,  N2.  I TYPE.  VALUE' > 

WRITE(5» 109) 

FORMAT  (' .') 

REAIK5. 101 >NB.N1 »N2. I TYPE. VALUE 
IF ( NB  .EQ.  99999)  GO  TO  200 
WRITE (3. 106)NB.N1.N2.ITYPE. VALUE 
IF  (NB  - 10000)  10.20.20 
IB  = IB+1 

IF  (Nl.EQ.O)  GO  TO  15 
IF  (Nl-IP)  16.16.17 
AIPLUS(NB)  = Nl 
GO  TO  15 

A2PLUS ( NB > * Nl-IP 
IF (N2.EQ.0)  GO  TO  18 
IF(N2-IP>  19.19.30 
AIMNUS(NB)  = N2 
GO  TO  18 

A2MNUS(NB>  » N2-IP 
GO  TO  (11.12.13)  I TYPE 


06100  11  Y(IB>  = < 1. »0. ) /CMPLX( VALUE. 0. > 

06200  GO  TO  2 

06300  12  Y ( IB ) = (0.  »-l.  >/<OMEGA*VALUE> 

06400  ' GO  TO  2 

06500  13  Y< IB)  = <0. .1. >*OMEGA#VALUE 

06600  2 YROW( IB)  . = NB 

06700  GO  TO  1 

06800  20  GO  TO  (21 .22.23.24)  I TYPE 

06900  21  IGM  = IGM  + 1 

07000  GM(IGM)  = VALUE 

07100  GMROW(IGM)  = N1 

07200  GMCOL(IGM)  = N2 

07300  GO  TO  1 

07400  22  IRM  = IRM  + 1 

07500  RM< IRM ) = VALUE 

07600  RMROW ( IRM ) = N1 

07700  RMCOL(IRM)  = N2 

07800  GO  TO  1 

07900  23  I ALPHA  = I ALPHA  + 1 

08000  ALPHA (I ALPHA)  = VALUE 

08100  AROW  < I ALPHA ) = N1 

08200  ACOL< I ALPHA)  = N2 

08300  GO  TO  1 

08400  24  IXMU  = IXMU  + 1 

08500  XMU(IXMU)  = VALUE 

08600  XMUROW(IXMU)  = N1 

08700  XMUCOL(IXMU)  = N2  * 

08800  GO  TO  1 

08900  200  ITOT  = IGM  + IRM  + I ALPHA  + IXMU 

09000  WRITE (5 » 110) 

09100  WRITE (3  f 105)  IB.  ITOT 

09200  WRITE<3» 107)  IGM.  IRM.  I ALPHA . IXMU 

09300  RETURN 

09400  100  FORMAT  (2I5.E10.5) 

09500  101  FORMAT  (4I5.E10.5) 

09600  102  FORMAT ('l'.25X>' NETWORK  BATA! '//16X. 13. ' NODES' /16X . 13. ' ACCESSIBL 

09700  *E  N0DES'//16X. 'ANALYSIS  AT'.E12.4.'  HZ '. 10 (/>. 2X NETWORK  BATA  CAR 

09800  *DS'//> 

09900  105  FORMAT <//25X. 'NETWORK  INPUT  COMPLETE !'. 13 . ' BRANCHES.'. 

10000  1 14.'  CONTROLLED  SOURCES  INCLUDING! '//) 

10100  106  F0RMATC20X. 13.315. E16. 7) 

10200  107  FORMAT< 160. ' TRANSCONDUCTANCES ' /I60 . ' TRANSRESISTANCES'/ 

10300  1 160.'  CURRENT  GAINS ' /I60. ' VOLTAGE  GAINS') 

10400  110  FORMAT (//'  NETWORK  INPUT  COMPLETE') 

10500  END 


SUBROUTINE  YBMAT 


10600 

10700 

10800 

10900 

11000 

11100 

11200 

11300 

11400 

11500 

11600 

11700 

11800 

11900 

12000 

12100 

12200 

12300 

12400 

12500 

12600 

12700 

12800 

12900 

13000 

13100 

13200 

13300 

13400 

13500 

13600 

13700 

13800 

13900 

14000 

14100 

14200 

14300 

14400 

14500 

14600 

14700 

14800 

14900 

15000 

15100 

15200 

15300 

15400 

15500 

15600 

15700 


C . 

C THIS  SUBROUTINE  COMPUTES  MATRIX  YB.  SEE  REF . . EQ.<3) 

C 

INCLUDE  '0THER4.F0R' 

INCLUDE  'ELEMNT.FOR' 

INTEGER  XR0UC30).  XC0L<80>»  UR0W<80>.  UC0L<80> 

COMPLEX  X < 80 ) > W( 80  > 

C 

C FORM  X = -Y*XMU.  ALL  MATRICES  ARE  STORED  SPARSELY. 

C SINCE  Y IS  'DIAGONAL'.  COMPUTATION  IS  SIMPLIFIED. 

C 

IF(IXMU  .EQ.  0)  GO  TO  11 
DO  10  L = l.IXMU 
II  = XMUROU(L) 

XROW(L)  = II 
XCOL(L)  = XMUCOL(L) 

DO  9 K=1 .IB 

IF ( YROW(K)  .EQ.  ID  GO  TO  10 

9 CONTINUE 

10  X(L)  = -Y(K)*XMU(L) 

11  IX  = IXMU 
C 

-C  FORM  U = -Y*RM.  ALL  MATRICES  ARE  STORED  SPARSELY. 

C SINCE  Y IS  'DIAGONAL'.  COMPUTATION  IS  SIMPLIFIED. 

C 

IF( IRM  .EQ.  0>  GO  TO  13 
DO  12  L=1 . IRM 
II  = RMROW(L) 

UROW(L)  = II 
WCOL(L)  = RMCOL(L) 

DO  14  K=1 . IB 

IF ( YROW(K)  ^EQ.  ID  GO  TO  12 

14  CONTINUE 

12  U(L>  = -Y(K)*RM<L> 

13  IW  = IRM 
C 

C FORM  X = Y + X + GM.  ALL  MATRICES  ARE  STORED  SPARSELY. 

C SINCE  Y IS  'DIAGONAL'.  AND  X AND  GM  HAVE  NEITHER  DIAGONAL 

C ELEMENTS  NOR  ELEMENTS  IN  COMMON.  COMPUTATION  IS  SIMPLIFIED. 

C 

DO  20  L=1 » IB 
X< IX+L  ) = Y(L ) 

XROWUX+L)  = YROU(L) 

20  XCOL ( IX+L  > = YROU(L) 

IX  = IX  + IB 
IF ( IGM  .EQ.  0>  GO  TO  31 
DO  30  L-l . IGM 
X ( IX+L ) * GM(L) 

XROU( IX+L)  = GMROU(L) 

30  XCOL (IX+L)  = GMCOL(L) 

31  IX  = IX  + IGM 
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15800 

C 

15900 

c 

FORM  U = 1 + U + ALFHA.  ALL  MATRICES  ARE  STORED  SPARSELY 

16000 

c 

SINCE  U AND  ALPHA  HAVE  NO  COMMON  ELEMENTS  AND  NO  DIAGONAL 

16100 

c 

ELEMENTS.  COMPUTATION  IS  SIMPLIFIED. 

16200 

c 

16300 

DO  40  L=1 . IB 

16400 

w<  iw+l  > = u.,o.) 

16500 

WROU< IU+L)  = L 

16600 

40 

UCOL ( IU+L ) = L 

16700 

IW  = IU  + IB 

16800 

IF (IALPHA  .EQ.  0)  GO  TO  51 

16900 

DO  50  L= 1,1 ALPHA 

17000 

U< IU+L ) = -ALPHA! L> 

17100 

UROU ( IU+L > = AROU(L) 

17200 

50 

UCOL ( IU+L ) = ACOL(L) 

17300 

51 

IU  = IW  + IALPHA 

17400 

C 

17500 

C 

FORM  YB  = U*X.  ALL  MATRICES  ARE  STORED  SPARSELY. 

17600 

C 

COMPUTATION  IS  COMPLICATED  BY  THE  FACT  THAT  EXISTING 

17700 

c 

ELEMENTS  MUST  BE  SOUGHT  BEFORE  THE  MULTIPLICATION 

17800 

c 

CAN  BE  PERFORMED. 

17900 

c 

18000 

IYB  = 0 

18100 

DO  90  K=1 > IB 

18200 

DO  80  L=1,IU 

18300 

IF (WCOL(L)  - K>  80,55.80 

18400 

55 

DO  70  M=1 , IX 

18500 

IF (XROU(M>  - K)  70,65,70 

18600 

65 

IF ( IYB  .EO.  0)  GO  TO  67 

18700 

DO  60  Nl=l » IYB 

18300 

IF ( YBROWINl ) - UROU ( L ) ) 60,75,60 

18900 

75 

IF( YBCOL < N1 ) - XCOL(M)  ) 60,85,60 

19000 

60 

CONTINUE 

19100 

67 

IYB  = IYB  + 1 

19200 

YB(IYB)  = U(L>*X(M> 

19300 

YBROU(IYB)  = UROU(L) 

19400 

YBCOL (IYB)  = XCOL(M) 

19500 

GO  TO  70 

19600 

85 

YB(N1)  = YB(N1>  + U(L)*X(M> 

19700 

70 

CONTINUE 

19800 

80 

CONTINUE 

19900 

90 

CONTINUE 

20000 

RETURN 

20100 

END 
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20200 

SUBROUTINE  ANEW 

20300 

C 

20400 

C 

THIS  SUBROUTINE  COMPUTES  THE  NOBE-TO-BRANCH  INCIDENCE  MATRICES 

20500 

C 

(SPARSELY  STORED  VECTORS  A1PLSP,  A1MNSP. 

A2PLSP , A2MNSP ) FOR 

20600 

C 

THE  NETWORK  WITH  NODES  I AND  J SHORTED. 

SEE  REF.,  EQ. (13) . 

20700 

C 

20S00 

INCLUDE  '0THER4.F0R' 

20900 

DO  150  L=1 , IB 

21000 

AIPLSP(L)  = AIPLUS(L) 

21100 

AIMNSP(L)  = AIMNUS(L) 

21200 

IF< A2PLUS ( L ) - (J-IP>>  20.30.40 

21300 

20 

A2PLSP(L)  = A2PLUS(L> 

21400 

GO  TO  90 

21500 

30 

IF(I-IP)  50,50.60 

21600 

50 

IF ( I .EQ.  0)  GO  TO  55 

21700 

IF ( A1 MNSP(L  ) - I>  52,51,52 

21800 

51 

AIHNSP(L)  = 0 

21900 

GO  TO  55 

22000 

52 

AIPLSP(L)  = I 

22100 

55 

A2PLSP(L ) = 0 

22200 

GO  TO  90 

22300 

60 

IF< A2MNUS < L)  - <I-IP>>  80.70.80 

22400 

70 

A2PLSPCL)  = 0 

22500 

A2MNSP(L>  = 0 

22600 

GO  TO  90 

22700 

80 

A2PLSP  < L ) = I-IP 

22800 

GO  TO  90 

22900 

40 

A2PLSP(L>  = A2PLUS(L)  - 1 

23000 

90 

IF (A2MNUS(L)  - <J-IP>>  100,110,120 

23100 

100 

A2MNSP(L)  = A2MNUS ( L > 

23200 

GO  TO  150 

23300 

110 

IF ( I-IP>  130,130,135 

23400 

130 

IF ( I .EQ.  0)  GO  TO  134  . 

23500 

IF  ( AIF’LSP  < L ) - I)  132,131,132 

23600 

131 

AIPLSP(L)  = 0 

23700 

132 

AIMNSP(L)  = I 

23800 

134 

A2MNSP(L)  = 0 

23900 

GO  TO  150 

24000 

135 

IF (A2PLUS(L)  - <I-IP>)  145,140,145 

24100 

140 

A2PLSP  < L > = 0 

24200 

A2MNSF(L>  = 0 

24300 

GO  TO  150 

24400 

145 

A2MNSP(L>  = I-IP 

24500 

GO  TO  150 

24600 

120 

A2MNSP(L)  = A2MNUSCL)  - 1 

24700 

150 

CONTINUE 

24800 

RETURN 

24900 

END 
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'35000 

SUBROUTINE  YMAT(A1PLSP,A1MNS» .A2PLSP.A2MNSP.Y2PI ,YN, IFP.IB, IQT 

25100 

C 

25200 

C 

THIS  SUBROUTINE  COMPUTES  THE  Y -PARAMETERS  AT  THE  ACCESSIBLE 

25300 

C 

NOHES.  SEE  RCF  . , EQ.<8>.  SPARSE  MATRIX  TECHNIQUES  ARE 

25400 

C 

USED.  AS  DESCRIBED  IN  REF..  PP  10-22. 

25500 

C 

25600 

INCLUDE  'ELEMNT .FOR ' 

25700 

COMPLEX  Y2PI<IQT.IQT).YN(3»3> 

25800 

INTEGER  A2PLSP  < 50  ) . A2MNSF’(50 ) 

25900 

INTEGER  AIF'LSF’  ( 50  > , A1MNSP  ( 50  ) 

26000 

COMPLEX  U,C<50, 50) »D< 50.50) .E (50.50) 

26100 

-c 

- • — — 

26200 

DO  100  1=1. IB 

26300 

11  = A2PLSP(I> 

26400 

12  = A2MNSF'(  I ) 

26500 

DO  100  J=1 » IB 

26600 

D(I.J)  = (0..0.) 

26700 

E(I.J)  = <0.,0.> 

26800 

J1  = A2PLSP(J> 

26900 

J2  = A2MNSPC J) 

27000 

U = <0. ,0. > 

27100 

IF  ( I1*J1  .NE.  0)  U = U + Y2PKI1.J1) 

27200 

IF  < I2*J2  .HE.  0)  U = U + Y2PKI2.J2) 

27300 

IF(  1 1*J2  .NE.  0)  W = U - Y2PKI1.J2) 

27400 

IF  ( I2*J1  .NE.  0)  U = U - Y2PKI2.J1) 

27500 

100 

C(I.J>  = U 

27600 

c 

27700 

DO  5 1=1. IB 

27800 

DO  5 L=1 . I YB 

27900 

K = YBROW(L) 

23000 

J = YBCOLCL) 

28100 

5 

D(I.J)  = D(I.J)  - C(I,K)*YB(L) 

28200 

C 

28300 

DO  10  K=1 » IB 

28400 

10 

D(K.K)  = D(K.K)  + <1.,0.) 

28500 

C 

28600 

DO  15  J=1 , IB 

28700 

DO  15  L=1 » IYB 

28800 

I = YBROU(L) 

28900 

K = YBCOL(L) 

29000 

15 

E( I . J)  = E<I.J)  + YB(L)*D<N.J) 

29100 

C 

29200 

DO  20  1=1. IPP 

29300 

DO  20  J=1»IPP 

29400 

20 

YN(I.J)  = (0..0.) 

29500 

C 

29600 

c 

COMPUTE  A1  * E * AIT.  A1  IS  STORED  SPARSELY. 

29700 

c 

29800 

DO  30  L=1 . IB 

29900 

IP  = AIPLSP(L) 

30000 

IM  = AIMNSP(L) 

30100 

DO  30  K=1 . IB 

30200 

JP  = AIFLSP(K) 

30300 

JM  = AIMNSP(K) 

30400 

IF(IP*JP  .NE.  0)  YN(IP.JP)  = YN<IP,JP>  + E(L.K) 

30500 

IF(IM*JM  .NE.  0)  YN ( IM, JM)  = YN(IM.JM)  + E(L.K> 

30600 

IF < IF'#JM  .NE.  0)  YN(IP.JM)  = YN(IP.JM)  - E(L.K) 

30700 

IF(IM*JP  .NE.  0)  YN(IM.JP)  = YN(IM.JP)  - ECL.N) 

30800 

30 

CONTINUE 

30900 

RETURN 

31000 

END 

31100 

31200 

31300 

31400 

31500 

31600 

31700 

31800 

31900 

32000 

32100 

32200 

32300 

32400 

32500 

32600 

32700 

32800 

32900 

33000 

33100 

33200 

33300 

33400 

33500 

33600 

33700 


C 

C 

C 

C 

C 


C 

C 

c 


10 
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SUBROUTINE  Y2PMAT  < Y2P.  IQP.  IQF'T.  A2PLSP.  A2MNSP) 


THIS  SUBROUTINE  FORMS  EITHER  Y2  = A2  * YB  * A2T  OR 

Y2P  = A2P  * YB  * A2PT . SEE  REF • . EQ.<2).  SPARSE  MATRIX 

TECHNIQUES  HAVE  BEEN  USED  TO  STORE  YB. 


INCLUDE  'ELEMNT . FOR ' 

COMPLEX  Y2P ( IQPT » ICPT ) 

INTEGER  A2PLSP(50) » A2MNSP<50) 


INITIALIZE  Y2P  TO  0. 


DO  10  1=1. IQP 
DO  10  J=1 » IQP 
Y2P<I,J)  = (0..0.) 


COMPUTE  Y2P  = A2  # YB  * A2T 


DO  100  M=1 . I YB 
L = YBROW(M) 

K = YBCOL(M) 

IP  = A2PLSP<L> 
IM  = A2MNSPCL) 

JP  = A2PLSP(K> 
JM  = A2MNSP(K> 
IF(IP*JP  ,NE.  0) 
IF < IM#JM  .NE.  0) 


Y2PCIP. JP> 
Y2P(IM. JM) 


Y2P(IP» JF) 
Y2P(IM.JM) 


YB(M) 

YB(M) 


I 


33800 

IF<IP*JM  .NE.  0) 

Y2P< IP. JM) 

= Y2F'(IP.JM)  - YB(M> 

33900 

IF(IM#JP  .NE.  0) 

Y2P( IM. JP) 

= Y2F'(IM.JP>  - YB ( M > 

34000 

100  CONTINUE 

34100 

RETURN 

34200 

END 
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OOIOO 

SUBROUTINE  5NEU 

00110 

C 

00115 

c 

THIS  SUBROUTINE  FORMS  THE  INVERSE  OF  Y2P,  CALLED  Y2PI, 

00120 

c 

WITH  NODES  I AND  J SHORTED,  USING  EQ.<20>  IN  THE  REF. 

00125 

c 

SPARSE  MATRIX  TECHNIQUES  ARE  USED.  SEE  REF . , PP.  14-18. 

00130 

c 

00200 

INCLUDE  '0THER4 . FOR' 

00500 

COMPLEX  XK ( 16 » 16  > * XM  < 16),C(16),XI(16,16),W,  MC 

00700 

INTEGER  V< 17) , XIR0U(16) 

00720 

c 

00740 

c 

FORM  VECTOR  V.  SEE  REF . » P.  17 

00760 

c 

00900 

DO  10  K-1.17 

01000 

10 

V<K)  = 0 

01100 

IF  ( J . EQ . IP+1)  GO  TO  25 

01200 

DO  20  K=1 , J-IP-1 

01300 

20 

V ( K>  = K 

01400 

25 

IQ  = N-IP 

01500 

DO  30  K = J-IP, IQ-1 

01600 

30 

V(K>  = K+l 

01700 

V < IQ ) = J-IP 

01800 

C 

01820 

C 

FORM  MATRICES  K AND  Mr  DEFINED  IN  REF.,  EQ.C22).  SPARSE 

01840 

C 

MATRIX  TECHNIQUES  ARE  USED.  SEE  REF.r  EQ.<24>  AND  (25). 

01860 

C 

02200 

DO  50  IR- 1 r IQ 

02300 

DO  50  IS— 1 r IQ— 1 

02400 

W = Y2I(V(IR>  rV(IS) > 

02500 

IF < IR-IQ)  60,65,65 

02600 

60 

XK< IR, IS)  = W 

02700 

GO  TO  50 

02800 

65 

XM(IS)  = U-Y2I ( I-1P, V( IS) > 

02900 

50 

CONTINUE 

02920 

C 

02940 

C 

FORM  R22  * Y2  * U.  SEE  REF.,  EQ.(26>. 

02960 

C 

03000 

DO  70  M=1 , IQ-1 

03100 

70 

CCM)  = Y2(V(M) , J-IP) 

03400 

IF( I .GT.  IP)  C(I-IP)  = CCI-IP)  + Y2(J-IP,J-IP) 

03420 

C 

03440 

C 

FORM  1 - M * (R22  * Y2  * U>.  SEE  REF.,  EQ.<21). 

03460 

C 

03500 

MC  = <1.,0.) 

03600 

DO  80  M=1 , IQ-1 

03650 

IF(C(M)  .NE.  <0.,0.))  MC  = MC  - XM(M)*C(M) 

03750 

80 

CONTINUE 

03800 

C 

03810 

C 

FORM  XI  = C * XM  / MC.  SINCE  C HAS  MANY  ZERO  ELEMENTS, 

03820 

C 

XI  HAS  MANY  ZERO  ROWS.  ONLY  THE  NON-ZERO  ROWS  OF  XI  ARE 

03830 

C 

rORMED  BELOW.  SEE  REF.,  EQ.  <21 ). 

03840 

C 
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03850 

IROU  = 0 

03860 

DO  85  K=1 > IQ-1 

03870 

IF <C(K)  .EQ.  <0.f0.)>  GO  TO  85 

038S0 

IROU  = IROU  + 1 

03890 

XIROW(IROU)  = K 

03900 

DO  85  M=1.IQ-1 

03910 

XI(IROU.M)  = CCK)*XM<M>/MC 

03920 

85 

CONTINUE 

03930 

C 

03940 

c 

FORM  Y2PI  = XK  * (XI  + 1).  RECALL  THAT  ONLY  THE  NON- 

03950 

c 

ZERO  ROUS  OF  XI  HAVE  BEEN  SAVED.  SEE  REF . t EQ.<21). 

- 03960  - 

c - — 

— — ■ 

03970 

DO  95  11=1. IQ-1 

03980 

DO  95  Jl=l » IQ-1 

03990 

U = CO. .0.  ) 

04000 

DO  90  L=1 . IROU 

04010 

K = XIROU(L) 

04020 

90 

U = U + XKCI1.KMCXKL.J1) 

04030 

95 

Y2PKI1.J1)  = U + XK(Il.Jl) 

04500 

RETURN 

04600 

END 

52 
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OOIOO 

SUBROUTINE  MINVC < Bf  Cf  NNf  NT  > 

00200 

C 

00300 

C 

THIS  SUBROUTINE  INVERTS  THE  COMPLEX  VALUED  MATRIX  B. 

00400 

C 

IN  THE  CALLING  PROGRAM f B HAS  BEEN  DIMENSIONED  NT  X NT. 

00500 

C 

THE  ACTUAL  DIMENSION  USED  IN  THE  CALLING  PROGRAM  IS  NN  X NN 

00600 

C 

THE  INVERSE  OF  B IS  RETURNED  AS  Cf  WHICH  IS  DIMENSIONED 

00700 

C 

AS  B. 

00800 

C 

00900 

COMPLEX  ALPHA ( 17  > . BETA < 1 7 > f SUM  f SUMP  f U ( 1 7 ) t V < 1 7 ) , L AMBDA 

01000 

COMPLEX  A(17f17>fB(NTfNT) fC(NTfNT) 

01100 

DO  1 1=1 fNN 

01200 

DO  1 J=1fNN 

01300 

C<IfJ>  = (O.fO) 

01400 

1 

A(IfJ)  = B(IfJ> 

01500 

DO  3 1=1 fNN 

01600 

A(IfI)  = A(IfI)  - (I.fO.) 

01700 

3 

C<IfI>  = < 1 • f 0 . ) 

01800 

DO  40  N = IfNN 

01900 

DO  10  I = NUfNN 

02000 

ALPHA ( I ) = A(IfN)/A(NfN> 

02100 

10 

BETA(I)  = A(NfI) 

02200 

DO  17  1=1 fNN 

02300 

SUM  = (O.fO.) 

02400 

SUMP  = (O.fO.) 

02500 

DO  15  K=N.NN 

02600 

SUM  = SUM  + C(IfK)*A(KfN> 

02700 

• 

15 

SUMP  = SUMP  + A(NfIO*C(KfI> 

02800 

U(I>  = SUM 

02900 

17 

V(I)  = SUMP 

03000 

SUM  =.  (O.fO.) 

03100 

DO  21  J = N»NN 

03200 

21 

SUM  = SUM  + V(J)#A(JfN> 

03300 

LAMBDA  = SUM  + A(NfN) 

03400 

DO  30  I = IfNN 

03500 

DO  30  J = IfNN 

03600 

30 

C(IfJ)  = C(IfJ)  - U ( I > *V ( J ) /LAMBDA 

03700 

IF(N  .EQ.  NN)  RETURN 

03800 

DO  38  I = N+IfNN 

03900 

DO  38  J = N+IfNN 

04000 

38 

A(IfJ)  = A(IfJ)  - ALPHA(I)*BETA( J) 

04100 

40 

CONTINUE 

04200 

END 
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ELECTRICAL  PROPERTIES  OF  MAGNESIUM 
AND  GERMANIUM  IMPLANTED  GALLIUM  ARSENIDE 

ABSTRACT 


Preliminary  results  have  been  obtained  in  a Hall/sheet  resis- 
tivity study  of  the  electrical  properties  of  Germanium  (Gej- 
and  Magnesium  (Mg)-implanted  Gallium  Arsenide  (GaAsJ.  Germa- 
nium samples  implanted  with  120  KeV  ions  at  room  temperature 
and  in  six  doses  ranging  from  5E12  to  1E15  ions/cnr  were  found 
to  produce  p-type  activity  after  encapsulation  with  silicon 
nitride  and  anneal  for  15  minutes  at  900°C.  Activation  effic- 
iencies over  25%  and  mobilities  between  100  and  200  cnr/V-sec 
were  determined.  At  this  time  optimum  activation  and  mobility 
cannot  be  linked  with  confidence  to  implantation  and  anneal- 
ing conditions.  Although  Ge-implanted  GaAs  is  known  to  be 
amphoteric,  p-type  activity  has  not  been  previously  reported 
to  our  knowledge.  Concentration  profiles  of  Mg-implanted  sam- 
ples as  a function  of  depth  were  measured  using  Hall/sheet 
resistivity  measurements  and  a chemical  etchant  to  strip  away 
measured  uniform  surface  layers.  Because  of  the  rapid  diffu- 
sion of  Mg  ions,  encapsulation  was  with  silicon  dioxide,  tak- 
ing advantage  of  its  low  deposition  temperature  of  325°C  com- 
pared to  700°C  for  silicon  nitride.  By  varying  anneal  times 
at  a given  temperature,  the  measured  diffusion  profiles  permit 
a determination  of  diffusion  coefficients  for  Mg  ions  of  vary- 
ing doses  in  GaAs  by  fitting  with  a Gaussian  distribution  which 
includes  diffusion  rate  and  time.  All  encapsulations  were  done 
in  a pyrolytic  reactor. 


Frank  L.  Pedrotti 
Marquette  University 
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This  research  project  involved  the  characterization 
of  ion-implanted  gallium  arsenide  by  electrical 
measurements.  Ion  species  investigated  include  both 
Germanium  and  Magnesium.  The  research  accomplished 
during  the  ten  week  period  represents  only  a begin- 
ning to  the  work  which  is  being  planned  along  these 
lines  by  this  laboratory.  In  accordance  with  the 
goals  of  this  USAF  sponsored  program,  some  of  this 
work  will  be  continued  at  Marquette  University  in 
collaboration  with  scientists  of  this  laboratory. 


This  work  was  supported  py  the  Air  Force  Office  of  Scien- 
tific Research  through  the  USAF-ASEE  Summer  Faculty  Research 
Program  (WPAFB),  contract  F44620-/6-C-0052,  the  Ohio  State 
University,  Columbus,  uhio. 


INTRODUCTION 

Gallium  Arsenide  (GaAs)  has  been  the  most  thoroughly 
studied  among  compound  semiconductors.  Some  of  this  work 
has  led  to  the  fabrication  of  laser  and  LED  devices,  micro- 
wave  elements  including  IMPATT  diodes  and  Gunn  diodes,  and 
others.  There  are  also  promising  applications  in  the  field 
of  integrated  optoelectronics. 

There  are  several  reasons  for  the  use  of  ion-implanta- 
tion as  a means  of  doping.  Especially  in  the  case  of  donor 
impurities  which  have  low  diffusion  coefficients,  thermal 
equilibrium  doping  techniques  require  that  the  sample  be 
subjected  to  high  temperature  processing  which  leads  to  com- 
positional changes  due  to  thermal  instability  of  the  com- 
pound. Also  the  material  tends  to,  resist  type  conversion 
due  to  autocompensation  by  intrinsic  lattice  defects.  The 
technique  of  ion  implantation,  in  which  a beam  of  dopant 
ions  is  energetically  driven  into  the  substrate  surface, 
offers  the  possibility  of  avoiding  these  difficulties  be- 
cause it  is  essentially  a non-equilibrium  process.  However, 
this  technique  produces  some  radiation  damage  of  the  lat- 
tice which  must  be  annealed  away,  again  requiring  elevated 
temperatures  and  the  possibility  of  the  recurrence  of  the 
problems  mentioned  above.  When  such  effects  can  be  kept 
small,  however,  by  the  techniques  to  be  discussed  presently, 
ion  implantation  offers  a means  of  controlled  doping  of 
crystals  for  the  fabrication  of  a variety  of  useful  devices. 

In  the  process  of  implantation,  the  ions  must  give  up 
their  kinetic  energies  to  the  host  lattice,  resulting  in 
radiation  damage.  The  lattice  defects  so  introduced  must 
then  be  annealed  out  by  appropriate  thermal  treatment. 

Often  such  defects  have  disassociation  energies  high  enough 
so  as  to  require  annealing  temperatures  close  to  the  melting 
point,  or  in  the  neighborhood  of  900-1000  °C.  However,  anneal- 
ing of  the  GaAs  at  such  temperatures  leads  to  disassociation 
and  out-diffusion  of  Ga  and  As,  as  well  as  the  implanted  dopant 


and  Chromium,  when  it  is  used  as  a substrate  dopant  to  produce 
high  resistivity.  An  effective  means  to  prevent  such  out- 
diffusion  appears  to  be  a capping  of  the  implanted  sample  with 
an  appropriate  material.  The  cap  then  functions  as  a barrier 
to  diffusion,  protecting  the  sample  during  the  high  temperature 
anneal.  Improper  encapsulation  leads  to  low  activation  of 
the  implanted  species  and  anomalous  carrier  concentration 
changes  in  the  substrate  material,  masking  the  effects  of 
the  implantation.  A thin  film  barrier  that  functions  in  this 
way  must  itself  be  mechanically  stable  and  able  to  withstand 
high  temperatures  without  blistering  or  loss  of  adhesion. 
Because  metallic  films  can  produce  serious  i ntermetal 1 i c 
diffusion  with  the  semiconductor,  dielectrics  have  usually  been 
used.  Both  silicon  dioxide  ( S i 0 2 ) and  silicon  nitride  (Si  3 N 4 ) 
have  been  used  as  encapsul ants , although  the  latter  is 
favored  for  n-type  dopants  because  it  has  been  shown  that 
S i 0 2 does  not  effectively  prevent  outdiffusion  of  gallium. 

The  presence  of  gallium  vacancies  may  actually  facilitate  the 
substitutional  location  of  p-type  dopants.  Even  when  succes- 
sful encapsulation  is  achieved,  --that  is,  when  the  cap  re- 
mains integral  during  the  anneal  and  effectively  prevents  out- 
diffusion --  the  anneal  itself  may  cause  changes  in  the  sub- 
strate. Such  changes  may  include  redistribution  of  chromium, 
diffusion  of  the  implanted  dopant  and  diffusion  of  defects. 

The  effect  of  the  annealing  must  then  be  well  known  and  taken 
into  consideration  in  the  use  of  this  technique  to  fabri- 
cate device  materials  with  specific  properties. 

The  ability  to  produce  doped  semiconductor  materials 
with  properties  tailored  to  a specific  application  requires 
a clear  understanding  of  the  final  depth  distribution  of  the 
imbedded  ions  (the  profile),  as  well  as  the  conditions  under 
which  these  ions  will  move  into  substitutional  sites  in  the 
host  lattice  to  provide  free  carriers  (activation)  of  reason- 
able mobility.  The  ideal  of  100%  activation  efficiency 
occurs  when  the  implanted  atoms  occupy  substitutional  sites. 


freely  ionized,  and  not  compensated.  The  highest  mobil- 
ities are  achieved  under  these  circumstances. 

A wide  range  of  experimental  techniques  have  been  util- 
ized in  order  to  determine  the  relevant  properties  (charac- 
terization) of  ion  implanted  semiconductor  materials.  These 
methods  include  photoluminescence,  cathodol umi nescence , glow 
discharge  optical  spectroscopy.  Hall  effect,  capacitance-voltage 
measurements,  proton-bombardment-induced  x-ray  analysis.  Auger 
analysis,  el  1 i psometry , and  others.  The  work  being  reported 
here  involves  characterization  by  the  use  of  Hall  effect/ 
sheet  resistivity  measurements  in  conjunction  with  repeated 
removal  of  successive  surface  layers  of  the  implanted  sample 
by  chemical  etching  in  order  to  determine  a profile  of  the 
implanted  species.  The  work  has  concentrated  on  a study  of 
carrier  concentration,  activation  efficiency,  and  carrier 
mobility  of  the  implanted  sample  as  a function  of  implanted 
dose,  anneal  temperature  and  time,  and  depth  penetration. 


EXPERIMENTAL  PROCEDURES 

Sample  Preparation 

The  procedure  outlined  in  the  following  describes  the 
treatment  of  all  samples  studied,  from  the  cutting  of  the  sub- 
strate to  final  measurements. 

Substrate  material  is  received  in  the  form  of  thin  wafers 
about  .020  inch  as  sliced  from  the  boule.  These  wafers 
were  <100>-  oriented  GaAs  single  crystals  made  semi i nsul ati ng 
by  doping  with  chromium,  and  obtained  from  Crystal  Specialties, 
Inc.  The  wafer  is  cleaved  after  scribing  with  a diamond 
stylus  into  .2  x .2  inch  squares.  Each  sample  is  then  cleaned 
while  anchored  into  position  over  a small  hole  providing 
suction.  The  cleaning  sequence  consisted  of  several  solvents 
directed  in  turn  onto  the  sample  surface  from  squeeze  bottles 
for  10-20  seconds  each:  10%  aquasol  solution,  deionized  water, 
trichloroethylene,  acetone,  and  methanol.  The  sample  was 
blow-dried  with  a jet  of  nitrogen  gas  after  the  water  and 
methanol  wash.  Each  sample  was  subsequently  free-etched  in 
a teflon  beaker  by  submerging  in  a solution  of  H2SO4 : 30%  H2O2: 
H2O  in  a ratio  of  3:1:1  by  volume  for  90  seconds.  The  clean- 
ing and  etching  were  done  immediately  before  ion  implantation. 

Ion  implantation  of  the  samples  followed.  An  ion  current 
of  approximately  1 microamp  and  energy  of  120  KeV  was  directed 
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at  the  sample  until  doses  ranging  from  3 X 10  to  1 X 10 
2 

ions/cm  were  absorbed.  The  installation  includes  a means 
of  suppressing  secondary  electron  emission  in  order  to  de- 
termine total  doses  accurately  from  ion  current  measurements. 
The  samples  were  implanted  at  room  temperature  with  the  beam 
from  a hot  cathode  source,  directed  6-10  degrees  off  from  the 
'100>  crystal  1 i ne  direction  in  order  to  minimize  ion  channeling. 
The  system  incorporates  a mass  spectrometer  by  which  impurity 
ions  are  separated  from  that  portion  of  the  ion  beam  strik- 
ing the  samples. 
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Following  implantation,  the  samples  were  cleaned  again  in 
the  same  way  prior  to  encapsulation.  The  protective  film 
deposited  on  the  surface  was  either  silicon  nitride  or  silicon 
dioxide,  depending  on  the  tests  being  run.  In  either  case, 
the  samples  are  placed  in  a pyrolytic  reactor  where  the  requi- 
site gases  are  introduced  and  react  on  the  heated  GaAs  to 
produce  a film.  The  sample  is  heated  to  695  °C  in  about  five 
seconds  by  means  of  a carbon-strip  heater.  In  the  case  of 
silicon  nitride,  the  reacting  gases  are  ammonia  and  silane 
(SiH^),  each  diluted  in  nitrogen;  in  the  case  of  silicon 
dioxide,  they  are  silane  and  oxygen.  In  both  cases,  the  sys- 
tem is  first  purged  after  evacuation  with  purified  nitrogen 
gas  taken  from  a tank  of  liquid  nitrogen.  Caps  grown  in 
this  way  were  typically  1000  A thick,  requiring  a reaction 
time  of  about  50  seconds. 

The  capped  specimens  were  next  annealed  in  flowing 
hydrogen  gas  for  times  and  temperatures  determined  by  the 
investigations.  Temperatures  ranged  from  700°C  to  900°C, 
and  times  from  3 minutes  to  15  minutes.  The  caps  were  re- 
moved by  etching  in  48%  hydrofluoric  acid  for  2-3  minutes 
(Si  3 N 4 ) or  for  30  seconds  (Si02),  and  then  recleaned  with  a 
water  and  methanol  wash. 

Electrical  contacts  were  then  made  on  the  four  corners 
of  each  sample  using  indium  solder  applied  with  the  help 
of  an  ultrasonic  soldering  gun.  In  most  cases,  slight 
deviations  from  Ohmic  behavior  resulted.  Contacts  were 
therefore  rendered  Ohmic  by  annealing  again  at  low  temper- 
ature (300°C)  for  5 minutes  in  an  Argon  flow. 

Electrical  Measurements 

Hall  effect  and  sheet  resistivity  measurements  were  made 
using  the  standard  van  der  Pauw  technique.  In  order  to  handle 
high  resistivity  materials,  the  system  employed  utilizes 
high  impedance  electrometers  operated  as  unity-gain  ampli- 
fiers. The  output  drives  the  shields  on  the  leads  between 
amplifier  and  sample,  resulting  in  minimal  leakage  currents 


and  a low  system  time  constant. 

The  van  der  Pauw  technique  is  particularly  useful  since 
sample  dimensions  and  contact  separation  do  not  enter  into 
the  calculations.  All  that  is  required  are  four  contacts  any- 
where on  the  periphery  of  the  sample.  The  method  requires 
measurement  of  current  and  voltage  corresponding  to  different 
pairs  of  contacts.  These  pairs  must  then  be  interchanged 
and  the  measurements  averaged  in  order  to  correct  for 
geometry.  Measurements  are  also  averaged  in  each  case  for 
forward  and  reverse  sample  currents  and  magnet  currents  in 
order  to  cancel  stray  gal vanomagneti c and  thermomagneti c 
effects  which  can  give  rise  to  spurious  voltages  superimposed 
on  the  Hall  voltage  to  be  measured. 

From  these  measurements,  the  sheet-resistivity  p$(resis- 
tivity  per  unit  thickness)  and  the  sheet  Hall  coefficient  R^ 
(Hall  coefficient  per  unit  thickness)  are  determined  directly. 
Hall  mobility  y^  and  sheet  carrier  concentration  can  then 
be  calculated  from  the  relations: 


PH  = RHS/  ps  ’ Ns  = r/  eRHS 

where  r is  the  ratio  of  Hall  mobility  to  conductivity  mo- 
bility (usually  taken  as  unity)  and  e is  the  electronic  charge. 
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Ns  represents  the  sheet  carrier  concentration  (ions/cm  ). 

Repeated  measurements  can  be  made  on  the  surface  of  the 
sample  after  successive  removal  of  measured  layers  by  con- 
trolled etching.  Thin  layers  can  be  stripped  away  uniformly 
and  without  damage  using  an  etch  solution  of  HgSO^-.OOiS  H2O2: 

H2O  in  proportions  of  1:1:50.  At  this  concentration  a typi- 
cal etch  rate  is  about  200  A/minute  as  measured  by  a Sloan 
Dektak  surface  profile  measuring  system  on  a control  sample 
placed  alongside  the  measured  sample.  Contacts  and  leads 
were  protected  from  the  etchant  by  coating  with  black  wax 
dissolved  in  trichloroethylene. 
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The  carrier  concentrations  (per  unit  volume)  and  the 
mobilities  in  the  i-th  layer  can  then  be  obtained  from  the 
rel ati ons : 


where  A.  (Vfs)i 


and 


Here  (R^)^  and  ( p $ ) ^ are  the  measured  sheet  Hall  coeffi- 
cient and  sheet  resistivity  after  removal  of  the  i-th  layer 
of  thickness  d^ . 

The  magnetic  field  strength  used  was  7.6  Kgauss  through- 
out. Sample  currents  ranged  from  milliamps  to  fractions  of 
a microamp. 


DISCUSSION  AND  RESULTS 


I . Germanium-Implanted  GaAs 
Background  and  Rationale' 

Very  little  data  exists  on  Germanium  (Ge) -implanted 
GaAs.  The  only  article  since  1 9 7 6 appears  to  be  one  by  R.K. 
Surridge  and  B.J.  Sealy  of  the  University  of  Surrey  (J.  Phys. 
D:  Appl . Phys  XO  (1977)  ).  This  article  reports  n-type  acti- 
vity for  200  and  300  KeV  Ge  ions,  annealed  at  700°C  for  15 
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minutes  using  an  Aluminum  cap.  Doses  ranged  from  10  to  10 
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ions/cm  . In  particular,  for  a dose  of  2x10  ions/cm 

and  energy  200  KeV,  these  investigators  found  a sheet  mobility 

2 

of  about  3600  cm  /V-sec,  a sheet  carrier  concentration  of 
12  2 

5 x 10  /cm  , and  an  activation  efficiency  of  only  2.5%. 

Germanium,  like  Silicon,  is  an  amphoteric  dopant  in 
GaAs,  with  the  possibility  of  producing  both  n or  p type 
activity  depending  on  implant  and  heat  treatment  conditions. 

It  is  desirable  to  determine  the  experimental  factors  cri- 
tical to  each  behavior. 

Penetration  of  Ge-implanted  ions  into  GaAs  should  be 
relatively  shallow.  At  an  energy  of  12Q  KeV,  the  peak 
concentration  should  occur  at  only  500  A deep.  Thus  Ge 
implants  may  be  useful  in  providing  good,  shallow  n-type 
1 ayers . 

Ge  implants  also  provide  the  possibility  of  making  good 
ohmic  contacts  on  n-type  material.  The  most  common  method 


at  present  is  to  evaporate  a Au-Ge  (88:12)  alloy,  followed  by 
a Ni  layer,  and  then  the  formation  of  a contact  at  around  450°C. 
The  function  of  the  Nickel  is  to  reduce  the  cohesion  of  the 
Au-Ge  alloy,  preventi  ng  ~i  t from  "balling  up".  The  fast-dif- 
fusing Au  drives  the  Germanium  into  the  material  as  a donor. 

Ion  implantation  offers  the  alternate  possibility  of  imbed- 
ding the  Ge  ions  directly,  hopefully  making  them  substitutional 
at  a substrate  temperature  significantly  less  than  450°C. 


Resul ts 

The  presence  of  a Ge-implanted  layer  in  the  samples  used 
was  confirmed  by  the  method  of  glow  discharge  optical  spectros- 
copy (G.O.O.S.),  which  looks  at  the  luminescence  of  a glow 
discharge  while  sputtering  away  the  surface.  A Germanium  line 
was  seen  appearing  from  about  400-500  A deep  during  the  sput- 
tering. 


Germanium-implanted  samples  (120  KeV)  in  six  doses  of 
5xl012,  lxlO13,  3xl013,  IxlO14,  3xl014,  and  lxlO15  ions/cm2 
were  all  encapsulated  with  S i 4 and  annealed  for  15  minutes 
at  800°C . Results  indicated  that  this  anneal  temperature  was 
too  low  to  achieve  sufficient  activation.  In  the  samples 
where  activation  was  measurable,  activity  was  p-type  with 
efficiencies  of  less  than  8%. 

Another  set  of  samples  was  prepared  in  the  same  way  but 
annealed  at  900°C  for  15  minutes.  The  two  highest  doses 
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showed  considerable  cap  failure  during  anneal,  the  cap  sta- 
bility improving  inversely  with  the  dose  level.  Electrical 


measurements  made  on  the  four  lowest  doses  showed  p-type 
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activity  with  good  activation,  ranging  from  28%  at  1x10  /cm 
12  2 

to  90%  at  5x10  /cm  . Mobilities  ranged  from  50  to  182  in 
these  samples.  These  activations  and  mobilities  showed  so 
much  improvement  over  results  at  800°C  anneal  that  they  were 
suspect.  An  investigation  of  the  substrate  alone,  processed 
through  identical  capping  and  annealing  stages,  was  planned 
in  order  to  determine  whether  conducting  surface  layers  are 
produced  on  the  semi -i nsul ati ng  GaAs  alone. 

This  study  was  carried  out  on  substrate  samples  from 
two  different  boules.  Results  showed  p-type  material  with  a 
wide  variation  in  the  value  of  average  resistance  measured 
directly  across  the  corners  of  the  sample  surface,  ranging 
from  several  kilohms  to  megohms.  An  effort  to  re-examine  the 
operation  and  flow  rates  of  the  pyrolytic  reactor  was  under- 
taken, since  it  was  believed  that  defective  capping  was  allow- 
ing changes  in  the  substrate  through  out-diffusion  of  Arsenic, 
loss  of  Chromium,  or  some  other  effect.  If  the  p-type  sub- 
strate, as  a result,  has  an  apparent  sheet  concentration  com- 
parable to  or  greater  than  that  of  the  ion-implanted  layer, 
or  if  its  resistivity  is  comparable  to  or  less  than  that  of 
the  ion-implanted  layer,  then  meaningful  results  descriptive 
of  the  ion  layer  itself  cannot  be  found  by  electrical  measure- 
ments. When  the  caps  had  improved,  with  more  consistent  re- 
sistances in  the  megohm  range,  the  investigation  proceeded. 

It  was  felt  that  an  intermediate  anneal  temperature 
around  850°C  might  produce  sufficient  activation  of  the  ion 
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implants  and  at  the  same  time  prevent  excessive  deterioration 

of  the  substrate  itself.  The  set  of  doses  was  annealed  at 

850°C  and  measured.  Activation  efficiencies  were  found  to 

2 

range  from  7 to  40  %,  and  mobilities  from  10  to  144  cm  /V-sec. 
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The  3x10  /cm  dose  sample,  with  15%  activation  and  mobility 
of  113  was  selected  for  profiling.  The  profile  of  this  sample 
however  showed  a concentration  of  carriers  almost  flat  from 
the  surface  into  the  sample  to  a depth  of  around  1300  A.  If 
these  measurements  correspond  to  carriers  due  to  activated  Ge 
atoms,  then  they  would  indicate  considerable  diffusion  of  the 

. 

implanted  profile  during  the  capping  and  15  minute  anneal  stages. 

i . j 

A second  Ge-implanted  sample  annealed  at  900°C  was  also 

profiled,  but  showed  similar  behavior,  reasonably  flat  at 
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a concentration  of  about  3 x 10  /cm  , and  then  dropping  off 
at  a depth  of  about  1700  A. 

In  all  cases,  the  Ge-implanted  layers  showed  p-type 
activity,  unlike  the  results  of  Surridge  and  Sealy  quoted 
earlier.  At  this  point  it  is  not  possible  to  pinpoint  the 
critical  difference  in  preparation  leading  to  opposite  acti- 
vities. More  investigation  of  Germanium-implanted  GaAs  will 
be  necessary  before  the  relation  between  implant  and  anneal 
parameters  and  electrical  properties  can  be  described  with 
confidence.  At  this  stage,  the  data  collected  is  sometimes 
inconsistent  with  previous  data.  However,  the  data  described 
previously  was  found  to  be  generally  reproducible  in  samples 
tested  later.  Some  of  this  data  is  summarized  in  the  accom- 
panying Table.  Undoubtedly  the  variations  in  quality  and 
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3.1E6 

144 

2.0E12 

40% 

1 E 1 3 

P 

7.SE4 

1.7E6 

22 

3.8E12 

37% 

3E13 

P 

1.3E4 

1.4E6 

113 

4.3E12 

14% 

1 E 1 4 

P 

2.3E4 

4.9E5 

21 

1.3E13 

13% 

3E14 

P 

3.6E4 

3.2E5 

9 

2.UE13 

6.5% 

1 1 July  1978 
900°C  Anneal 

Dose:  3E13 

P 

4.5E3 

6.9E5 

154 

9.0E12 

30  % 

1 E 1 4 

P 

2 . 2E3 

2.4E5 

109 

2.6E13 

26  % 

26  July  1978 
yOO°C  Anneal 


1 E 1 3 

P 

4 . 5E4 

6.5E5 

14 

y.6E12 

96% 

3 E 1 3 

P 

2.6E4 

6.UE5 

23 

1.0E13 

35% 

1E14 

P 

2.6E3 

2.6E5 

101 

2.4E13 

24% 

3E14 

P 

7.7E3 

1.4E6 

182 

4.4E12 

1 . 5% 

consistency  of  the  caps,  and  in  the  substrate  material  itself, 
are  chiefly  responsible  for  discrepancies  in  measured  electrical 
properti es . 


II.  Magnesium-Implanted  GaAs 


Background  and  Rationale 


In  a study  of  Magnesium-implanted  GaAs  conducted  by  Dr. 
Y.K.  Yeo  of  this  laboratory,  it  was  determined  that  Magnesium 
ions  have  large  diffusion  coefficients,  producing  rapid 
changes  in  their  diffusion  profile  during  heat  treatment.  A 
portion  of  this  investigation  directed  to  the  determination  of 
the  magnitude  of  the  diffusion  coefficient  for  Magnesium  was 
undertaken  as  part  of  this  project  and  will  be  briefly  reported 
here . 

The  basic  theoretical  framework  which  predicts  the  con- 
centration distribution  with  depth  (profile)  of  implanted 
ions  is  due  to  the  unified  range  theory  of  Lindhard,  Scharff, 
and  Schiott  (LSS  theory).  This  theory  produces  for  a given 
ion,  substrate  material,  and  ion  dose  & energy,  values  of  the 
mean  projected  ion  range  R and  a standard  deviation  in  ion 
range  R$,  which  are  fitted  to  a Gaussian  distribution  of  the 


form: 


/v,  Do** 

^ ' -£F  Rs 


- a-  «i 


where  n is  ion  density  and  x is  distance  from  the  surface. 


Once  implanted,  ion  diffusion  occurs  which  alters  the 
theoretical  profile  to  a diffused  profile.  The  latter,  which 
depends  on  thermal  treatment  of  the  sample,  can  be  shown  to 
be  describable  analytically  by  replacing  R$  in  the  above  by 
the  quantity  + 2Dt  , where  D is  the  diffusion  coefficient 

p 

(function  of  temperature)  in  cm  /sec  and  t is  the  diffusion 
time.  This  solution  to  the  diffusion  equation  assumes  no 
surface  condition  affecting  out-diffusion  there. 

Thus  the  measured  concentration  profile  can  be  fitted  to 
a Gaussian  distribution  by  appropriate  selection  of  diffusion 
coefficient  D corresponding  to  anneal  time  t at  a given  anneal 
temperature.  A study  of  diffusion  profiles  for  a given  temp- 
erature but  different  times  then  provides  a method  of  deter- 
mining diffusion  coefficients  for  a particular  implanted  ion. 

In  the  case  of  the  Magnesium-implanted  ions.  Dr.  Yeo  had 
found  that  diffusion  was  so  rapid  that  considerable  change  in 
the  implanted  profile  already  occurs  during  the  short  heat 
treatment  involved  in  capping  alone.  With  S i 4 capping 
this  treatment  is  at  a temperature  of  695°C  for  a time  of 
about  45  seconds.  Because  of  this  preliminary  diffusion,  sub- 
sequent anneals  up  to  temperatures  of  700°C  were  found  to  make 
no  appreciable  alteration  in  the  electrical  behavior  of  the 
samples.  The  present  study  endeavored  to  use  a S i 0 2 caP  which 
could  be  put  down  at  a much  lower  temperature  of  325°C  in  order 
to  permit  a diffusion  rate  study  at  temperatures  lower  than 
700°C.  Since  Magnesium  implanted  GaAs  is  p-type,  the  SiO^ 
cap  would  not  present  any  difficulties,  especially  considering 
the  low  anneal  temperatures  planned.  Diffusion  behavior  at 


higher  temperatures  was  already  available  from  Dr.  Yeo's 
measurements . 

Resul ts 

This  effort  was  only  begun  in  the  time  available.  Two 
doses  of  Mg-impianted  GaAs  were  capped  with  about  1 <;00  a of 
biO.,  and  then  annealed  at  three  different  anneal  times  at  a 
temperature  of  7U0°C. 

Each  sample  was  to  be  profiled  and  compared  to  results 
for  the  same  dose  and  anneal  temperature  but  a different  an- 
neal time  in  order  to  determine  the  diffusion  coefficient, 
bince  a single  profile  requires  a series  of  Ha  1 1 /res i s ti vi ty 
measurements  occupying  a full  day's  time,  only  a few  of 
these  profiles  could  be  made.  The  results  of  the  profiles 
to  date  show  good  agreement  with  the  previous  measurements  of 
Dr.  Yeo. 

In  continuing  this  effort  further,  several  future  lines 
of  investigation  have  been  suggested  by  the  work  to  date.  A 
continuation  of  the  diffusion  study  for  Mg-implanted  ions 
should  include  additional  profiling  with  silicon  nitride  caps 
at  other  temperatures,  especially  at  700°C  anneal. This  will 
permit  a better  understanding  of  the  effect  of  the  capping 
material  on  diffusion  profiles.  In  addition,  the  effect  on 
the  profiles  of  the  Ohmic  anneal  itself  should  be  system- 
atically studied. 


